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1. Introduction ’ 


The algebraic theory’ of associative central simple algebras is concerned chiefly 
with the automorphisms of such algebras, with their subalgebras, with the condi- 
tions that they be division algebras, and with the conditions that two differently 
generated algebras be isomorphic. These topics are surely of equal importance 
in the theory of nonassociative central simple algebras and there is, in addition, 
the study of the conditions that two algebras be isotopic. 

A study of these topics in the nonassociative case has only been made for such 
algebras as Lie algebras, Jordan algebras, and Cayley algebras. There are cen- 
tral simple algebras which are much more like the associative crossed product 
algebras than are any of the above types and no study has yet been made of these 
topics for them. It is natural then to begin this new study with the discussion 
of an interesting special case. 

We shall consider here certain algebras of order four over a general field §. 
They are a generalization of the “generalized quaternion” algebras of L. E. 
Dickson which have proved to be such an important special case in the associative 
theory, and specialize to the associative case when a single parameter +r = 0. 
The algebras are not associative if and only if + ~ 0, 1 and are central simple.” 

We shall show that the theory of isomorphism, automorphisms, proper sub- 
algebras, and division algebras for these algebras is actually much simpler than 
in the associative case, a result quite contrary to what might have been expected. 
Indeed, the theory is remarkably like the theory of quadratic fields. We shall 

also obtain a complete study of the conditions that two of our quasiquaternion 
division algebras over a field of characteristic not two be isotopic.’ 


2. The multiplications of a crossed extension‘ 


We consider an algebra 3 with a unity quantity e¢ and let n be the order of 3 
over the base field §. We call the linear transformations a — ax on 3 its right 
multiplications p, and the transformations a — za its left multiplications d. . 
Then x is non-singular if both Ps and i, are nonsingular, that is, are one-to-one. 


‘ef. Chapter IV of my Structure of Algebras, A. M. S. Colloquium Publications vol. 24, 
New York, 1939. 

* Our algebras are permutation algebras and hence are central simple. See my Non- 
associative algebras II. New Simple Algebras, these Annals vol. 43 (1942), pp. 708-723. 

* A student of mine at the University of Chicago is engaged in a study of the arithmetics 
: ag algebras and I expect to have the isotopy theory completed by another 
student. 

‘The study of crossed extensions is the subject of the paper referred to in footnote 2. 

present section is an addition to that paper. 
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A crossed extension & = (3, G, 5, g) of 3 by G and § relative to the extension 
set g is an algebra of order mn over §. It contains 3 as a subalgebra and has ¢ 
as its unity quantity. The set G is a group of m nonsingular transformations on 
3 such that eS — e for every S of G. The setg = {gs,r} consists of nonsingular 
quantities gs, 7 of 3 with one such quantity for every pair S, T of G. The algebra 
% is the supplementary sum of the m linear subspaces js3 for S in G, and j,;3 = 
3 where I is the identity transformation. ‘The set 5 is any subset of G which 
contains J, Multiplication in %f is now defined by the distributive laws and by 


(1) (jsa)(jrx) = T, 2). 


The function ¢ is the product of the three factors a7’, x and gs,r. The factors 
aT and x are multiplied in the order (a7’)x or x(aT) according as S is or is not in 
, and their product is multiplied on the right or left by gs, 7 according as ST is 
or is notin . Then (jsa)(jrx) = jsraR(S, T, x) where 


(2) R(S, T, 2) = TPAGs,r 


such that P, = p, or d, according as S is or is not in §, Q, = p, or A, according 
as ST is or is not in §. 
The quantities a and x of the algebra 2 may be expressed uniquely in the form 


(3) a= jsas, z= 
for as and x7 in 3. Then 
(4) ax = ak, Rz = || R(S, T, x7) || 


is an m-rowed square matrix whose element in the S-row and ST7-column is 
given by (2). 

In a similar fashion (jrx)(jsa) = jrslgrs(xS-a)], or jrsl(~S-a)gr,s], or 
jrs\g7,s(a-xS)], or jrs[(a-xS)g7,s], where we use the product (7S) a or a (x8) ac- 
cording as T' is or is not in §, we multiply by gz,s on the left or right according 
as T'S is or is notin §. Then za — aL, where 


(5) L, =||L(S, 7,22) L(S, T, 2) = 

and P, = dor pz according as T is or is not in , Q, = Az or py according as T'S 
is or is not in §. 

_ It follows now that % is a division algebra if and only if every R, (or every Lz) 
with x ¥ 0 is nonsingular. 

3. Crossed extensions of commutative algebras 
If 3 is commutative every pz = i, and the distinction between right and left 

factors disappears. This eliminates the complexity in (2) and (5) as well as the 
role of § and we may write U = (3,G,g). Then L, (but not R,) may be taken 


to have elements in 3 and za — aL, where the element in the S-row and ST- 
column of L, is 


(6) 
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In the special case where 8 is associative as well as commutative the matrix 
|, has a determinant which we may call the norm of x and designate by N(z). 
Itisa quantity of 3.- When 3 is a field the algebra & is a division algebra if and 
only if N(x) 0 for every 0. 

We may now define a type of algebra which we shall call a quasicyclic algebra. 
We take 3 to be any commutative associative algebra of order n over §, and with 
sunity quantity e, and let © be a cyclic group of order n. Assume that the 
extension set g is such that jr = j' for every 7 = S* where S generates @ and 
thusj = js. Assume also that j" = ye for j ~ 0 in § and then have 


according as k + 4 < nork +i 2 n, where x,” = x,S*. The general quantity 
of isc = + + and 


(8) L, = ” ” ” ” ” 
1 1 1 (n-1 


We shall call n the construction degree of the quasicyclic algebra % and shall 


write % = (3, S,y). Such algebras exist for every 3 and y. Indeed a basis . 


&,***,@, Of 3 over may always be selected (in infinitely many ways if § is 
infinite) such that e = e, + --- + e,. Then the transformation S defined by 
mer + + —> + + + has order n and leaves e un- 
altered. . 


4. Quasiquaternion algebras 
A quasicyclic algebra of construction degree two is a linear space 


= (8, = 3 +53 
of order four over § such that y ~ 0 is in §, 


(9) (y + jz)(a + jb) = ay + ybz’ + jaz + by’) 


for all a, b, y, 2 of the algebra 3 of order two over §. We shall call all such 
algebras quasiquaternion algebras, and shall say that % is a split or a nonsplit 
algebra according as y = 6° or y ¥ & for 6 in §. 
Let us now determine all possible transformations a — a’ = aS defining 
quasiquaternion algebras. We observe first that any 3 of order two with a 
unity quantity is necessarily both a commutative and an associative algebra. 
Indeed 3 = ef + uw such that 


(10) ue = pu + oe (p, o in §). 
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When the characteristic of § is not two we may always choose u so as to take 
p = 0 and we shall do this. When the characteristic of § is two then p = 0 
or we may choose u so as to take p = 1. We shall thus assume that p = 0 or 1 


in this case. i 


The linear transformation 
(11) a= me + agu— aye + an(pe — u) 


and the identity transformation are the only automorphisms of 3 over §. Also 
(11) has order two unless § has characteristic two and p = 0. The quasiqua- 
ternion algebra % defined by taking S to be either the identity automorphism or 
the automorphism (11) is associative and it is well known that % is not associative 
for all other values of S. We seek to determine all such transformations. 

Since e’ = e we have wu’ = re + du, (u’)’ = re + O(re + du) = r(1 + det 
du =u. Then 


(12) r(1+8)=0, & =1. 
The transformation given by 6 = —1 is 
(13) =a, + au—a’ = ae + a(re — u). 


But & = 1 implies that 5 = —1 only when 6 = 1, the characteristic of § is not 


two, 27 = 0, 7 = 0, u’ = u, Sis the identity transformation. Hence S is given 
by (13) for 
(14) t #0, p. 


That 7 ¥ 0 follows from p = 0 when § has characteristic not two and from the 
fact that (13) is the identity transformation when 7 = 0 and § has character- 
istic two. : 


5. Elementary properties 
The algebra 3 = ${u] with unity quantity e is equal to §{i] for i = 7 u. 
Then 


for aand Bin § ~ 1. Here = O in case the characteristic of § is not two. 
With this generation of 3 the linear transformation a — a’ is uniquely defined by 


(16) a= me + ati me + ale — 4) 


for every a and ae of §. 

Lemma 1. Letabein 3. Thena’ = aif and only if a = ge; a’ = —aif and 
only if a = &; a’ = e — aif and only if a = 1 + &, wherev = e — Qi and éis 
in 

For a = a’ if and only if a: = a + a2, a2 = 0. Also. 

(17) a+a’ = (2a + ape 
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is ero if and only if a, = —2a,,a = o(e — 27). Finally a + a’ = e¢ is equiva- 
jent to 20n + a2 = 1, a2 = 1 — 2m,a = 17+ a(e — 22). 

The quantity v = e if the characteristic of § is two. Otherwise v(e + 27) = 
(1 - 4a)e and v has an inverse in 3 if and only if 


(18) 1 — 4a ~ 0. 
Define a linear transformation on 3 by 
(19) a = aye + at > a* = me + a(—e — 2). 


Then if (18) holds every @ of 3 determines a unique d = v ‘a and conversely 
a = vd is uniquely determined by d. We then have 

Lemma 2. Let d = va. Then d* = (v’)"a’. 

For a = dv = ( + det)(e — 21) = (6: — Beale + (52 — 25;)i, a’ = (6, — 
+ b — 2di)e — (52 — = — — Boa)e — (2 — 26,)i. Then v’d* = 
— 2i) (bie — — bet) = doe — dye — + (25, — 25, + as desired. 

We shall use these lemmas in the study of isotopy. Let us observe next that 
the mapping 


(20) r=yt+je>rA =y—je 
isan automorphism of %. This follows from the fact that 
(21) (y + jo2)(a + job) = ay + r*ybe! + jolaz + by’) 


for every jo = Xj where X ¥ Oin §. Then (21) with \ = —1 implies that A 
is an automorphism of Y%. When 1 — 4a ~ 0 the quantity v has an inverse 
(1 + 2i)(1 — 4a)~ and (vx)v* = v(av) = zA. Thus A may be called an 
inver automorphism in this case. 

The automorphism A is the identity automorphism if § has characteristic two. 
Write 


(22) iy = i + pe 

and have 

(23) ip = we + (0 — 1) =e — (we +i) =e — be, 
48 well as 


is = we + Bit ae = Biot (a + Bu — we. 
In particular i) = Be — i defines a mapping B given by 
(24) me + mi + + me + mio + + Saito 


such that [(me ++ ot)’1B = [(me + n2t)B]’, and B is an automorphism of & over 


§ characteristic two. 
We shall write % = (8, S, y) = (a, vy) when § has characteristic not two. 


By (21) we have 
(25) (a, = (a, 
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When the characteristic of § is not two we may always choose wu so as to take 
p = 0 and we shall do this. When the characteristic of § is two then p = 0 
or we may choose u so as to take p = 1. We shall thus assume that p = 0 or 1 


in this case. 
The linear transformation 
(11) a= me + au — ae + an(pe — u) 


and the identity transformation are the only automorphisms of 3 over §. Also 
(11) has order two unless § has characteristic two and p = 0. The quasiqua- 
ternion algebra % defined by taking S to be either the identity automorphism or 
the automorphism (11) is associative and it is well known that Y is not associative 
for all other values of S. We seek to determine all such transformations. 

Since e’ = e we have u’ = re + 6u, (w’)’ = re + d(re + du) = r(1 + det 
Su =u. Then 


(12) &=1 
The transformation given by 6 = —1 is 
(13) a=a+au—-a’ = ame + a(re — u). 


But & = 1 implies that 6 ~ —1 only when 6 = 1, the characteristic of is not 
two, 27 = 0, 7 = 0, uv’ = u, S is the identity transformation. Hence S is given 
by (13) for : 

(14) t #0, p. 


That + ¥ 0 follows from p = 0 when § has characteristic not two and from the 
fact that (13) is the identity transformation when 7 = 0 and § has character- 
istic 


5. Elementary properties 


The algebra 3 = §[u] with unity quantity e is equal to [i] for i = 7 wu. 
Then 


(15) Bit ae 


for aand Bin § andB #1. Here = in case the characteristic of is not two. 
With this generation of 3 the linear transformation a — a’ is uniquely defined by 


(16) a= me + at > me + ale — 2) 


for every a and a of §. 

Lemma 1. Letabein 3. Thena’ = aif and only if a = te; a’ = —aif and 
only if a = &; a’ = e — aif and only if a = 1 + &, where v = e — 2i and Eis 
in 

For a = a’ if and only if a: = a + a,a=0. Also. 


a +a’ = + ae 
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) take ;, gro if and only if a = —2m,a = a(¢ — 2%). Finally a + a’ = ¢ is equiva- ‘| 
lent to 201 + a2 = 1, a2 = 1 — 2a,a = 7+ — 
Jor] The quantity v = e if the characteristic of § is two. Otherwise v(e + 27) = . 
(1 - 4a)e and v has an inverse in 3 if and only if 4 
(18) : 1 — 4a ¥ 0. 
Define a linear transformation on 3 by 
(19) a = aye + > a* = me + — 2). 
qua- 
m or Then if (18) holds every a of 3 determines a unique d = va and conversely | 
ative q = vd is uniquely determined by d. We then have { 
Lemma 2. Let d = va. Then d* = (v') 
e + For a = dv = + — 2%) = (6, — + (62 — 2h)i, a’ = (6) — ! 
Yra + — — (52 — 26:)t = — — oa)e — (52 — 26,)2. Then v’d* = 
-(¢ — 2i)(be — — = dye — dye — + (25, — 262 + as desired. 
We shall use these lemmas in the study of isotopy. Let us observe next that 
the mapping 
(20) t=ytjerrA =y—je | 
aia isan automorphism of %. This follows from the fact that | 
ven Be (21) (y + joz)(a + job) = ay + d’ybz! + jo(az + by’) 
for every jo = Aj where A ¥ Oin §. Then (21) with \ = —1 implies that A 
is an automorphism of %. When 1 — 4a ~ 0 the quantity v has an inverse 
the (1 + 2i)(1 — 4a) and = v(av") = zA. Thus A may be called an 
ay inver automorphism in this case. 
The automorphism A is the identity automorphism if § has characteristic two. 
Write i 
(22) ip = i + pe 
and have 
(23) io = we + — 1) =e — (ue +i) =e — 
4s well as 
vO. 
by is = we + Bi + ae = Bio + (a + Bu — w De. 
In particular ip = Be — 7 defines a mapping B given by 
(24) me + mi + + > me + mio + + Saito 
nd such that [(me ++ 2t)’]B = [(me + n2t)B]’, and B is an automorphism of & over 
§ characteristic two. 
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We shall write % = (3, S, y) = (a, vy) when § has characteristic not two. 
By (21) we have 


(25) (a, 7) (a, \’y). 
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’ When § has characteristic two we shall write &% = (a, 8, y) and have 


(26) (a, B, = (@ + Be - B, r’y) 


as a consequence of (24) and (23) if 8 = 0. If 6 = 0 then ip = ye + i, ij = 
(a + ue, jo = dj yields (26) for 6 = 0. 


6. The norm and trace functions 
The general quantity of W is given by 
t=yt+je, yrometm, 
for m, m, ti and fin §. Theny and z are in 3 and we define 
(27) = yy’ — ve’. 


We compute x’ by the use of (19) to obtain 2” = y? + yzz’ + jely + y’) = 
y + yee’ + — ty’). Then — T(x)x = — yy’, and 


(28) T(x)x — N(z). 
By (17) 
(29) T(x) = 2m + m. 


Also yy’ = (me + mi)(me + me — mi) = (ni + mme + nx — nit. Using 
(15) we have 


(30) N(x) = Ni(zje + N2(z)i, 

where 

(31) Ni(x) = 11 + mm — ma — — $a), 
(32) N(x) = (1 — 6)(n2 — $2). 


The trace function is in §[e] for every x and the norm function NV (2) i is in 3. 
Then N(x) is a scalar multiple of e if and only if 7; — yf? = 0. Now2’ — T(x)z 
is in the algebra §[z] generated by x and this algebra contains N(x). If N2(x) #0 
the algebra §[7] is contained in §[z]. When $[z] is quadratic we have §{i] = 
$[z]. This proves 

Fee 3. Let §[x] be a quadratic subalgebra of A where x is not in 3. Then 
N: 2 x) = 0. 

We have called a quasiquaternion algebra a split or nonsplit algebra according 
as y is or is not the Square of an element of §. By (30) the quantity N(z) is in 
if and only if 7; — = 0. Thenif nonsplit algebra we have = 
vt only if m = f2 = 0, vis in S[j]. We have proved 

Lemma 4. If & is nonsplit and x is not in 3B the quantity N(x) is in § if and 
only if x is in §[j]. 

As a consequence of Lemmas 3 and 4 we have 

THEOREM 1. The only quadratic subalgebras of a nonsplit quasiquaternion 
algebra are its subalgebras 3 and §[j]. 
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7. The left multiplications of 2% 


The left multiplication L, of a quantity x = y + jz of a quasiquaternion 
igebra 9 is a linear transformation on % which has been exhibited* as 


63) 


No difficulty arises if we regard the elements of L, as quantities of 3 when we 
are studying a single algebra. However when we study the isotopism of quasi- 
quaternion algebras we will be forced to consider them as linear spaces of order 
four over and to replace e by the two-rowed identity matrix, 7 by the two 


rowed matrix 


in the quantities y, y’, z, 2’ which appear in (33). 

The multiplication 
(35) L;= 
has the property 

But ja = j(y + jz) = ye + jy, aj = (y + = 
(7) = 123, Leb; = Lie. 
It follows that 
(38) (ja)e = j(xe), (aj)e = x(Je) 


for every c and x of &%. Thus % has a partial associative law. 
We also compute the sum of the matrices of (36) to obtain . 


(39) + = 
where 
(40) yT (ze + T(y)j, 


= + Ty) = 2m + mare in §. Then q is in and L, is in 
i} 
Tet us now suppose that c = a + jb is a quantity of % such that L, is non- 


‘Note that this is formally the same as the representation of an associative quaternion 
lgebra which has been presented in many places. However z — 2’ is an automorphism 
in the associative case and is not an automorphism here. 
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Then 
(41) N(c) = aa’ — bb’ 
is a nonsingular quantity of 3. Write d = [N(c)]~ and have 
a 

(42) Le da ), 

(43) L c + Le L; ( ya'd + ( vad 
It follows that + = oi + for o; and in § if and only if 
(44) —y(b+ =aN(c), = oN(c). 


However by (17) we see that either o, = o2 = 0 or N(c) isin §. By Lemma3 
we see that if 2 is nonsplit either c is in §[j] orb + b’ = a+a’=0. Applying 
Lemma 1 we have a = tv, b — mv, c = fu + njv = v(ge — nj) = vg. We have 
proved 

THEOREM 2. Let % be a nonsplit quasiquaternion algebra and c be a left non- 
singular quantity of X such that L;L>* + L;z'L; is in [Lj]. Then c is either in 
SLi] or has the form c = vg where g is in Sj], 


& wqw\ —ne\/v O\ _ 


If c = me + mj for m and m in § and not both zero then L, is nonsingular. 
By (37) we have Lb, = (mI + mbjLe = Lee, Le = Le'Lee. The linear 
space L(%) of all left multiplications of % has order four and so does L;'L(%). 
But contains L(%), = L(A). 

Suppose next that c = vg where g is in §{j].. Then if v is a nonsingular quan- 
tity of 3 we have = (L7'L,)* = LS = L(A). But by 
Lemma 2 we have 


—1 
0 y 2\ % 


where y = vy, 2 = v%. Thus L7'L, is the left multiplication of the general 
quantity of the algebra % with a basis 1, i, j, ji such that 7” = Bi + ae,j’ = v6, 
ij = j(—e — i). This algebra is % if § has characteristic two. Otherwise it 
is isomorphic to % since in %> we have i} = ae, inj = j(e — %) if 9 = —i. Then 
the linear transformation replacing i by —i replaces L;>'L, by PL,,P™' and we 
have proved 

TuEorEM 3. Let % be a nonsplit algebra, c be a left nonsingular quantity of % 
such that L;L;' + is in Then = for a non- 
singular linear transformation P. 


8. Quadratic subalgebras 


The results we shall obtain on the isomorphisms of quasiquaternion algebras 
will be consequences of a determination of their quadratic subalgebras. We 
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have already made this determination for nonsplit algebras in Theorem 1. 
Hence we may assume that 2 is a split algebra. By (25) and (26) we may take 
y=l Let Ble] be a quadratic subalgebra of % and §[z] ¥ 3. By Lemma 3 
we have m= £2, t2 = em where ¢ = tl Then N(x) = Ni(z) = (ni + mm — 
{1 — im + = [mi — + (m — and thus 


(46) N(x) = (m — + + moe. 

We study first the case where T(x) = 2m + m = 0. Then m+ m = —m and 
(47) = —N(x) = (m — ei). 

If m = ef: then {1 = em, m = —2m, {2 = —2em and 2 = 0. We have 
proved 


TueorEM 4. A quantity x in A = (3, S, 1) which is not in 3 has the property 
that x” = 0 if and only if x = mw or n(wA) where n ¥ 0 is in § and 


(48) w= (e + jv, wA = ( — jy. 


Suppose next that 2” ¥ 0 but T(z) = Then by (47) we may normalize 
m + 1, = em, m = —2m,2= 
no + j or m(wA) + 7. 

TazorEM 5. The quantities 


(49) t= mw+j, wd) + j 


of X = (3, S, 1) defined for » in § and w of (45) have the property that x* = e. 
They are the only quantities of A with this property. 

A quadratic algebra [x] of characteristic not two has a generation with 

T(z) = 0. Thus we have 

TazorEM 6. Let % = (a, 1) over § of characieristic not two. Then the quad- 
ratic subalgebras of A are 3, Flwl, §{wA] and the algebras §[x] such that r=e 
defined by (49). 

If the characteristic of § is two the quantity v = 1 and (48), (49) imply that 
the only quadratic subalgebra §[2] such that 7(x) = 0 and §[z] ¥ 3 is SL]. 
There remain the subalgebras [x] with T(x) ~ 0. Then we may assume that 
Mz) = 1 = 2m + m= m,N(e) = (mt at = (nit 
Puta = 2 — (m+ and see that 


Siu] = Fel, = — (i + = (mt hide = 


Hence we may assume that N(x) = 0, and thus that m + f: = 0 or 
that + ¢; = 1. In the former case m = = m = landz = m(e+j) + 
i+ ji. In the latter case x = e + m(e + j) + i + ji which generates the 
same algebra as does m(e + j) + i + ji. We have proved 

Tuzorem 7. Let % be a split quasiquaternion algebra over § of characteristic 
two. Then the only quadratic subalgebras of % other than 3 and §{j] are the al- 
gebras $x] such that x” = x where x is defined by 
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(50) c= ii, 


for n in §. 

Note that if & = (a, 0, 1) there are two subalgebras of %& containing quanti- 
ties which are the roots of inseparable quadratic equations, but if & = (a, B, 1) 
with 6 ~ 0 there is only one such algebra. Hence no (a, 0, 1) is isomorphic 
to an (a, B, 1) with 6B ¥ 0. 


9. Generations of quasiquaternion algebras 


We seek all generations A = 3o + jo3o of a quasiquaternion algebra Y = 
(3, S, vy) such that 


(51) Bo = = Bolo tone, jo= ye + tajo = jo 
for ao, Bo, Yo in §. Applying (40) for an arbitrary generation we have 


(52) jox + ajo = Tu(x)e + ; 
where 7'(x) and T(x) are in §. It follows that 
Sliol ¥ 3. 


For otherwise jo = we + di where \ ¥ 0 and are in joj + = Qj + Hj 
is not in 3, a contradiction. 

If % is nonsplit its only quadratic subalgebras are 3 and §[j] and if $[jo] ~ 3 
then §{jo] = $7], 30 = 3. When the characteristic of § is not two then jo = dj 
for \ ~ 0 in § and the result of this generation &% = 3 + (Aj)3 is given by 
(a, y) & (a, \*y). If the characteristic of § is two then $[jo] = F[j] for any 
jo = we + Ajanddr #0, §. Then % = ae + where a and are in §, 
Joto + taJo = (ueto + tome) + + (are)AJ + + = andj = we + dj 
if and only if » = 0, jo = Xj, a2 = 1. The result of this generation is given by 
(26). 

Assume next that 2 is a split algebra over § of characteristic two. By the 
argument above the only generation with §[jo] = §[j] and Bo = 3 is that re- 
sulting in (26), (27). By (53) and Theorem 7 we necessarily have §{j| = 
OL], jo = we + Aj for ¥ Oandywin When ¥ 3B we have Jo = with 
x given by (50). Then = + ave, + joto = Aar(aj + jx) = Aai(jt + 
aj + +e- = Aau(e + j). Then dojo + jJoto = jo if and only if (e + j) = 
we + dj, = 1,4 = 4, jo = +9). But then jo = 0 contrary to our hypothe- 
sis that yo ~ 0. Indeed jx = 0 and A ¥ Fx] + joH[z]. We have proved that 
there are no other generations other than that given by A = 3 + (dj)3. 

Our final case is that given by &% = (a, 1) over § of characteristic not two. 
Since [jo] ~ 3 we may use Theorem 6 to obtain §{jo] = [x] where z = mw + 
enjv + 6j such that & = ¢ = 1. Moreover we may assume that je = e and so 
jo = x. But jv + vj = O and (jv)v + v(jv) = 0, 2v + vr = nv = ne — 2i)° 
n(e + 4ae — 4%). This is not in §{2] unless 7 = 0,2 = +7. We have proved 
our main 
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Tasorem 8. The only generations (51) of a quasiquaternion algebra (3, S, y) 
are those in which 3o = 8, jo = Aj for d ¥ Vin §. 


10. Automorphisms and isomorphism 


An isomorphism 7' of a quasiquaternion algebra % = (30, So, yo) on a quasi- 
quaternion algebra % = (8, S, y) carries 30 into a quadratic subalgebra 397’ 
of and jo into a quantity jo7’ of such that = + (jo7')(BoT) and 
= yee, BoT = + GoT)io = jo. By Theorem 8 = 3, 
T induces an isomorphism of 3 and Also jo = Aj O in F. By the 
proof of Theorem 8 we have 

TazorEM 9. T'wo quasiquaternion algebras (a, y) and (ao , Yo) over of char- 
acteristic not two are isomorphic if and only if op = a, yo = d’y for ¥ 0 in F. 
Two algebras (a, 8, y) and (a , Bo , Yo) over § of characteristic two are isomorphic if 
and only if By = B, = a+ p + = d’y ford O and in §. 

An automorphism of (a, y) over § of characteristic not two is an isomorphism 
which replaces 7 by itself and j by \, such that \*y = y,\” = 1. ThendA = +1 
and we have proved . 

TurorEM 10. The automorphism group of (a, y) over § of characteristic not two 
is the group (I, A) of order two defined by (20). ‘ 

We also see that an automorphism of (a, 0, y) over § of characteristic two re- 
places i by i) = 7 + pe, j by Aj such that \*°y = 7, ” = 1, = 1, (Gi + pe) = 
If = (a, 8,7) an automorphism replaces i by i) = 
i+ we where = Bip + we, jo = such that = y,A = 1. Then % = fe — 
ior7. We have proved 

ToeoreM 11. Let § be a field of characteristic two. Then the automorphism 
group of (a, 0, y) is the identity group, the automorphism group of (a, B, y) for 
wery 8 * 0 is the group I, B of order two given by B defined in (24). 

It is a curious fact that while the antiisomorphs of quasiquaternion algebras 
are algebras of a similar type they are not quasiquaternion algebras. For let 
% = (30, So, vo) be antiisomorphic to % = (3, S,y) under a mapping 2 — 27’ 
on % to YX such that (aoco)7’ = (co7")(xoT’). Then 307’ = 3: is a quadratic sub- 
algebra of isomorphic to Bo, 3: + where = joT, ZB: = = 
Then ji: + af = ji, ij: = — ti). It follows from Theorem 6 
that 3: = 3, 29 — zo7' is an isomorphism of 3o and 3, j: = Aj for A ¥ 0 in §. 
But (joz0)yo = jo(zoyo) for every yo and 2 of Bo, [(jozo)yo]T = = 
wap) = [(zoyo)T = However = = 
Nilews)’, = and (z:y1)’ = contrary to our assumption that 
Sis not an automorphism of 3. 


11. Division algebras 


The algebras 3 and $[j] are commutative and associative algebras and if either 
contains a divisor of zero so does %. Hence is a division algebra only if 3 
and §[j] are fields. Conversely let 3 and §[j] be fields, 7° = ye. Then y ¥ 5 
for any of §, N(x) = yy’ — zz’ = 0 only if (19) holds. Then y and z are in 
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y’ = =2,y' — yz =0,thatise =0. This proves that is a division 
algebra. We have proved 

THEOREM 12. A quasiquaternion algebra is a division algebra if and only if its 
two quadratic subalgebras are fields. 

For the algebras of Theorem 10 and the algebras (a, 0, y) = (a, y) this theorem 
states that (a, y) is a division algebra if and only if neither @ nor y is the square 
of a quantity of §.' For those of Theorem 11 with 6 ~ 0 it states that (a, 8, y) 
is a division algebra if and only if y is not the square of any quantity of § and 
a ~ — Bu for any of §. 

A scalar extension field 8 of § will be called an intersecting field for a division 
algebra Y& over § if Ae is not a division algebra. From Theorem 12 we have 


THEOREM 13. A scalar extension 8 of § is an intersecting field of a quasiqua- § 


ternion division algebra % over § if and only if KR contains a subfield over § iso- 
morphic to one of the two quadratic subalgebras of %. 

If § is a finite field of characteristic two every quantity of § is a square. 
Thus there are no quasiquaternion division algebras, and indeed no nonsplit 
algebras, over such a field. A finite field § of characteristic p ~ 2 contains 
q = p elements and there is only one quadratic extension of §. It is the field 
of q elements and § contains a quantity y which is not a square in § such that 
if 7’ = ye then Ij] isa field. Necessarily a = ry, 7’ = ae and there are precisely 
3(q — 1) distinct nonzero values of 7°. By Theorem 9 two algebras defined for 
distinct values of a are nonisomorphic and we have proved 

THEOREM 14. Let § be a finite field of q elements where q is odd. Then there 
exist 3(q — 1) quasiquaternion division algebras over § such that no two of these 
algebras are isomorphic. 


12. Isotopy of division algebras 


Let x be the general quantity of any algebra %, L. be the left multiplication 
of x as a quantity of Y. Then the left multiplications Lo of any isotope > 
with a unity quantity of % are given® by 


LS 


where P and Q are fixed nonsingular linear transformations on Y, c is a fixed left 
nonsingular quantity of Y. If % is a quasiquaternion algebra and we take z = 
je we use (37) to write = = L;. Then = PL;P™ and 
we have 

Lemma 5. Every isotope with a unity quantity of a quasiquaternion algebra U = 
(3, S, y) contains a quantity jo such that (L)? = yI. 

We also have 

Lemma 6. Let &% = (3, S, 7) be a nonsplit quasiquaternion algebra. Then the 
only quantities x of X such that L2 is a scalar transformation are the quantities x in 
with this property. 


* For the definition of an isotope and the fundamental properties see my Nonassociative 
algebras I. Fundamental concepts and isotopy, these Annals vol. 43 (1942), pp. 685-707. 
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For if is a scalar then eL; = = ée. If x is not in then is a 
quadratic subalgebra of & and, by Theorem 1, §[z] = 3. But if z is in 3 then 


xz 0 0 
0 2] 0 


only if 2” = = 6e. = x” if and only ifz’ = + x. In the former 
case’ = x is in §[e] and hence in §{j]. In the latter case § has characteristic 
not two and 2’ = mv = n(1 — 22), e=n(l—4i+ 4a) de. 

As a consequence of the proofs of our lemmas we have 

TaeorEM 15. Every quasiquaternion isotope %o = (30, So, Yo) = 3o + JjoBo 
of a nonsplit quasiquaternion algebra A = (3, S, vy) = 3 + j3 is nonsplit and 
= such that (Ls))* = 

For by the proof of Lemma 5 we have (L{)’ = yI where 

ji = 

Then #{j:] is a field. By Theorems 6 and 7 the only subfield of a split quasi- 
quaternion algebra is 3 and, by the proof of Lemma 6, no z in 3» and not in 
fle] has the property that (L{”)? = yZ. Hence % cannot be a split algebra, 


is a quadratic subfield of the quadratic field = 
We now observe that if the characteristic of § is not two then necessarily 


jh = Ajo 
where 40 is in §. By (89) we have LOL! + LOLO = ml + mL$! for 
every 2 in %> where m and 2 arein §. If § has characteristic two this result also 
holds since jr = + Ajo and + = + Ly, 
But then, multiplying by P™ on the left and P on the right, we have 


+ = (ml + 


for m and min §. Take « = c to obtain the result used in (43) to imply Theorem 
3. We have proved 

TuroreM 16. Let & and %, be isotopic nonsplit quasiquaternion algebras. 
Then there exist nonsingular transformations P and Q on % such that 


LY = 


for every x of A. 

We next prove 

TozoreM 17. Jf = (8, S, vy) and = (Bo, are isotopic quasiqua- 
lernion division algebras the fields 3 and 3» are isomorphic. 

For we may construct scalar extensions ® and & isomorphic over § respec- 
tively to 3 and 35. The algebra %g contains a quantity x such that L, is singu- 
lar, PL,P™ is singular, (W%)¢ is not a division algebra. Similarly %g, is not a 
division algebra. By Theorem 13 the field & is isomorphic either to 30 or to 
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Sljol, the field is isomorphic either to 3 or to Since Fly] and are 
isomorphic these isomorphisms imply in all cases that 3 is isomorphic to 3. 

If § has characteristic two Theorem 15 implies only that 7; = doe. + A jo for 
do and X ¥ O in §. However if the characteristic of § is not two the property 
ji = ye implies that j, = jo. Since we shall not complete the study of isotopy 
of quasiquaternion algebras here we shall restrict our attention to the case of 
division algebras over § of characteristic not two. Then & = (a, y), % = (a, 
yo) such that 7; = (Ajo)” = A*yoe = ve. Then by (25) % = (a0, 7) and we may 
assume that yo = 7. 

We may now use the basis e, 7, j, jt of Y& over § as a basis of %. Then multi- 
plication in % is defined by c= cL such that j-7 = ji, 7-74 = ace, j-j = ye, 
It follows that 


LY =L;, LP = 
0 e-% 


_ By Theorem 17 we have a = ya and we may take 


to = pt 
for Oin §. 
We now observe that = y so that\ = + = + j, PL;P™ = 
=+1. Write 


P, P, P, P; P, 
P= , PL;= L,P = 
Ps yP, P. yP, 


It follows that Py = 44> eP. 45 P; = 3> eyP. 2 and thus that 


P, P, 
eyP2 eP. 


Suppose now that L{” = PL,P~ where c = a + jb. By (39) ‘age in % we 
have + LOLY = LO thus L.L; + = L;. It follows 
from (40) that T(b) = 0, T(a) = 1. By Lemma 1 


b=m, a=i+h, (3, 


We now compute PL, and L{P to obtain the relations 
(53) Pia — yPsb = wiP,, P,b — — a) = 


as well as two other, but equivalent, relations. Then yP2bPia — uiP;,yP.b° + 
(Pia — piP;)(e — a) = wi(Pia — wiP,). Then Pifyb? + ale — a)] = wiPi 
poaP,. It follows that 
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(64) wiP, = Py, y = yb + ale — a) + wae, 
so that y isin 8. Also by (53) 
(55) Pi(a — y) = Prob. 


Ifb = 0 then Pya = iP, and Pia’ = (ui)’P, = Pi(ave), Pi(a” — ave) = 0. If 
P, ¥ 0 then our hypothesis that 3 is a field implies that a” = aye, a = + ui. 
However a’ = +yu(e t), a+ = = Wo = (a, is isomor- 
phic to A. If Pi = O then ¥ 0, Pra’ = (—yi)P2, P.(a’)* = (—pi)*P2 = Py 
(aye), a” = aoe, a’ = + yt, (a’)’ = + i) =e—a',a’ +a= + pe = 
y= + 1 and again & and % are isomorphic. 

Hence let b ¥ 0. Then 


(56) P,=Pyz, z= (a — yb)", 


P; 0 
(57) P= 
O P,i/\evz 


is nonsingular if and only if P; is nonsingular and e — yz ¥ 0. However e — 
=e —y(a — y) yb") 0 if and only if 


(58) (a — y) # yb’. 


Since P; is nonsingular the quantity y = Pz'uiP; has the property that y* = 
vae. But y isin 3 and thus y = + ui. Since only ,’ is determined by % we 
may assume the notation yu selected so that y = ui. We compute y = yn’ 
(1+ 4a — + + — 2i) — [ae + — + — + dae] + 
ave and see that y = yi if and only if 


(9) p= 4 +1, (14 +77) = a(l — 


Then 1 — p = 4(€ — & + yn’), (1 + 4a)(1 — = 4a(1 — uw’), (1 — u)(1 — 
= 0 and u = 1 or (4a). Note that the first equation of (59) is equivalent to 


(60) (1 — — = 


If 1 = (4a) in (60) there exist quantities {1 = (1 — 2¢)2a, f2 = 4na such 
that a = ¢f — yf}, 7¢3 = ¢i — a,7 is the norm of a quantity of the field 8. We 
have proved 

18. Let = (a, 7) = (3, S, y) be a quasiquaternion division algebra 
over § of characteristic not two such that y is not the norm of any quantity of 3. 
Then a quasiquaternion algebra is isotopic to U if and only if it is isomorphic to Y. 

Suppose next that y is a norm and so a = {i — {7 for {: and & in §. Then 
fand » may be determined so that ¢: = (1 — 2£)2a, {2 = 4na and (60) is satisfied 
for» = (4a)"1, Then (59) is satisfied and there exist quantities a and b in 3 
such that un; = yb? + a —a’+ wae. If (a — pi)” = yb’ then a’ — + u2ae 
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= (2ui — ea, 
a=ypt. Buta =i+ &e — 2) anda = wi if and only if = 0, w» = 1 a contra- 
diction. Hence the transformation 


= pita’ —a— wae, — pi = (Qui — e)a, (Qui — 


defined by z = (a — ui)(yb)~ is nonsingular and 


e-—-wm 


PL ‘a(t ) = = L;. 


Then PL(M)P™ contains PL,P™ for every x = ame + ae, and thus contains 
Le 40,1. Since L; = LL; we have 


in PL(WP", = L(%). We have proved 

TuHEeorEM 19. Let % = (a, vy) = (8, S, vy) be a quasiquaternion division algebra 
over § of characteristic not two and y be the norm of a quantity of 3. Then a quasi- 
quaternion algebra is isotopic to A if and only if it is either isomorphic to A or to 
, y) where a = (16a). 

We shall conclude our study with a discussion of the real case. By Theorem 
12 every quasiquaternion division algebra over the field of all real numbers is 
an algebra & = (—y", —1), where uw is any positive real number. By Theorem 
18 two such algebras defined for distinct values of u are not merely nonisomorphic 
but are actually nonisotopic. 
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ON THE DECOMPOSITION OF MODULAR TENSORS (11) 


By R. M. 
(Received October 14, 1943) 


Introduction 


The present paper is the second part of [8] (brackets refer to bibliography) 
referred to below as Part I. We number sections, formulae, theorems, etc. 
consecutively from those in Part I, and use the same notation. For instance, 
tis a field of characteristic p with ¢ (< ) elements; &, is a f-vector space of 
dimension n; @ = @(n, f) is the full linear (modular) group of all f-automorph- 
isms of B ; Bm is the space of all tensors of rank m over B; ; Im is the Kronecker 
m” power representation of G; %,, is the enveloping algebra of Il, . Super- 
sripts zero indicate the analogous quantities defined over a field f° of character- 
istic zero. 

The decomposition of non-modular tensors, or equivalently the determination 
of the reduced form of 11%, is now a classical part of algebra. The objective of the 
present sequence of papers is to obtain a similar theory for modular tensors, or 
equivalently to study the reduced form of II, . Any representation of G whose 
space is a subspace or factor space of %,, , or a direct sum of such spaces we call a 
lensor representation of G. If q (and therefore ) is finite we denote the group 
ring by T. One of the main results of the present paper is that there exists a 
faithful tensor representation of T (Th. X §14). From this it follows from an 
unpublished theorem of Nesbitt that every representation of @ is equivalent 
toa tensor representation, but we do not establish or apply this last result below. 

An important feature of the study of modular representations of groups has 
been the use of “induced representations”. One starts with a finite group and 
anon-modular representation of it, and after suitable number theoretic prepara- 
tions take residue classes and obtain a moduiar representation. A generaliza- 
tion of this process would be to induce both the group and the representation. 
We leave for future investigation the determination of the general theory of 
such a process, and content ourselves here with the application of the idea to ob- 
tain from each irreducible representation of the non-modular full linear group a 
representation of the modular full linear group. This is done in §9 below. In 
§10 a character theory is developed for tensor representations of @; this is applied 
in §11 to prove that every irreducible representation of @ is equivalent toa 
tensor representation,’ and in §§12, 13 to obtain specific values of the irreducible 
and indecomposable modular characters for the representations II,, having 
m < 2p. 

In Part I the reduced form of II, for m < 2p was determined subject to the 
assumption g = m. In §§15-20 the situation for m < 2p is cleaned up by re- 


‘Since submitting this paper the author has learned that this result was also obtained 
by Carson Mark in his Toronto thesis (1938, as yet unpublished). 
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moving this assumption and finding what further reductions can be made when 
q = p. A main tool in this investigation is the character theory of §13. The 
most troublesome case is form = 2p — 1,n = 2. The difficulty (and also special 
interest) of this case is due to the influence of zero divisors in the ring of functions 
over a finite field. This case is the only one for m < 2p where these zero divisors 
need be considered; but for larger values of m they will enter much of the time. 


§9. Induced tensor representations 


In the present section we show how to construct a modular tensor representa- 
tion from each ordinary irreducible tensor representation. (We use the term 
“tensor representation’’ to refer to a representation of the full linear group whose 
space is a subspace or factor space of the space of all tensors of some rank, or 
for a direct sum of such representations). We follow the notation of Part I for 
modular representations; i.e. %, is a space (of dimension n) over a field f of 
characteristic p. BW, (the space of all tensors of rank m) is representation space 
for the Kronecker m™ power representation A —-IIn(A) =AX---xXA 
(m factors) of the full linear group G@(= G(n, f)) of all non-singular linear trans- 
formations A of %; onto itself. Let £ be a field of characteristic zero. We shall 
use superscripts zero to indicate analogous groups, spaces, and representations 
over f°. Thus @’ is the full linear group on a f°-vector space B} (of dimension n), 
etc. In what follows we shall frequently choose for f° a field such that f is residue 
class ring of the ring of integers in f° with respect to some prime ideal. When 
this is the case we shall say that f° is a field which induces f. 

An irreducible constituent of II}, is characterized [10. P. 124, Th. 4.4D] by a 
partition of m. We write $°(A): A° — H°(AXA°) for the irreducible constituent 
of associated with the partition (A): + = 
=r> 0. 

Lemma VII. With proper choice of basis vectors in the representation space, 
H(A), of S°(A), we can find polynomials h°,,(X$,) in n? indeterminants X%, and 
with rational integer coefficients such that the matrix A° = || ax || is represented by 
the matrix H°(AXA°) = || 

Proor: We consider the f’-group ring R°(= %°(m, f°)) of the symmetric group 
Gm. In Part I [8. P. 675] we showed how to interpret the elements, s, of Gn 
as linear operators on the tensor space %),, and we now define a = )_a(s)s, 
(a(s) in f°) as an operator on &, by linearity; i.e. aX° = }\a(s)sX° for all vectors 
of Bi, and all elements a of 

We start with the classical theorem that H°(\) has the form e®?, where ¢ is a 
primitive idempotent of ®° [10. P. 129, Th. 4.4D]. We also use the fact that 
e (as an element of %°) is commutative with each operator II?,(A°). 

We consider the natural basis for the tensor space &°, (i.e. the one relative to 
which the effect of A° is given by the Kronecker m™ power of A°). Let x(i:, °°: 5 
im) denote the particular basis tensor which has its 7; , --- , im component unity 
and all other components zero, and set ex(i;, , im) = tm). Then 
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is the set of all f-combinations of the 2’(i,,---, im). Since I,(A°)e = 
we have from 
that 


Let 3° denote the ring of rational integers. . The set of all 3°-combinations 
of the z'(i:, , tm) is an W°, contained in H°(A). It is a classical 
property of 3°-spaces (lattices) that W° has an 9°-basis which is also a f-basis 
for H(A) [9. P. 142]. (By “basis” we mean a linearly independent set of vectors 
which span a space.) Now our lemma follows from formula (52) if we express 
the 2’(i:, -** , 4m) and the vectors of the given 3°-basis in terms of each other. 

let 3, denote the field of integers mod p, and let Xx be indeterminants over 
3,. Denote by huo(Xix) the polynomial obtained from hi»(Xix) by replacing 
each coefficient by its residue class mod p and X% by Xu. Then we have: 

lemma VIII. Let A = || aix|| be any element of (the full modular linear group) 
G, and set H(XXA) = || Auv(aix) ||. Then the correspondence $(A): A — H(AXA) 
no representation of @G, which is referred to as the representation induced by 
§ (A). 

Proor: Let A° = || a || and B® = || bi || be matrices whose elements are 
indeterminants over 3° and set A°B° = i.e. ch, = Since $°(d) is a 
representation of @° we have 


Let bi be indeterminants over the field $§,. The mapping as —> Gn, 
bi: > bx , integer — its residue class mod 7, is a homomorphism of the ring 
$'laix , onto bi]. Under this homomorphism (53) is replaced by 


where cx = >> aiyb,x. If we identify 3%, with the prime field of f and specialize 
the ai, and bj, into elements of f (such specialization is a homomorphism) then 
(54) is seen to be the condition that (A) should be a representation of G. 
Observe that the representation (A) just constructed depends upon the 
particular idempotent e and after that upon the particular basis chosen for the 
S'space W°. We do not contend that two different primitive idempotents ¢ 
(belonging to the same (A)) need give rise to equivalent 6(A), although it is evi- 
dent that two different choices of bases for W° will give rise to equivalent (A). 
What we can expect is that all the S(A) (for a given (A)) will have the same 
ireducible constituents and this can be proved by arguments strictly analogous 
to those used by R. Brauer for finite groups [2. P. 954]. We omit the proof. 
Although $°(d) is a tensor representation it is not evident that §(A) is. The 
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root of the difficulty is that the idempotent e = Ye(s)s will not in general have 
any image in Rt (because the e(s) need not be integral at the prime spots divid- 
ing p). We do not investigate the general problem of determining those induced 
representations which are tensor representations; the following lemma is suff- 
cient for our needs. 

Lemma IX. For each partition (d) there is at least one induced representation 
H(A) which is a tensor representation. 

Proor: For any (A) we can find a primitive idempotent e = > e(s)s where 
each coefficient is one of the numbers 1/h, 0, or —1/h, h a rational integer 
(10. P. 125, Th. 4.3F]. In the proof of Lemma VII replace 2’(i;, --+ , in) by 
im) = ha’ (ir, im). The %°-combinations of the x’’(i;, , in) 
constitute an 3°-subspace Wi of W°, and every vector of is in the 3°-space 
generated by the basis tensors he = >che(s)s has coefficients 


' 1, 0, or —1 and so has a non-zero image p in the modular group ring . The 


homomorphism argument used to establish Lemma VIII can now be used to 
show that p%,, is representation space for the induced representation H(A). We 
omit the details. 


§10. Characters 


Following Brauer and Nesbitt [1. P. 559] we use complex numbers as char- 
acters for modular representations. For the ensuing argument we temporarily 
restrict ourselves to a finite field f so that G will be a finite group. However, 
the results obtained are easily extended to infinite fields. Suppose that the 
order of G is g = p’g’ where (g’, p) = 1. Let f be the splitting field over f for 
the equation x” = 1, and take for f° a field which induces f’. Let ¢ be a primi- 
tive g’™ root of unity in f’ and let w be a complex number primitive g’™ root of 
unity. (We may suppose that w is in f° and that « is the image in f’ of w.) 

Let T: A — T(A) be any modular representation of G. The eigen-values of 
T(A) will all be powers of ¢ and if A is p-regular (i.e. of order prime to p) T(A) 
is similar to a diagonal matrix of the form 


We call xz(A) = w™' + w™ + --- the character of the representation T. Brauer 
and Nesbitt have proved that a necessary and sufficient condition for two modu- 
lar representations T and T’ to have the same irreducible constituents is that 
xr(A) = xr-(A) for all p-regular A in © [1. P. 560]. 

Our program is to calculate the character xq)(A) of the representation 5(), 
and then use it to get information about the irreducible and indecomposable 
tensor characters of G. We may consider §(A) as being a representation of the 
group &’ = G(n, f’). Any element A of G’ can be regarded as the residue class 
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image of some matrix A° in the group G’ = G(n, f). If two elements Ai, As 
of @ are similar the same is true of T(A,), T(A2) for any representation T of G’ 
(for similarity in the full linear group is demonstrated by inner isomorphisms). 
Since every p-regular element of @ is similar to a diagonal element of G’ we can 
compute the character of (A), considered as a representation of G, provided 
we know the value of xq)(A) for the similar diagonal f’-matrices A. 

Suppose A = diag(a:, a@,---) where a; = e**. Then set Ao = diag(a} , 
a;,:**) Where a, = w%*, The homomorphism of the ring of integers in f° 
onto t’ sends A° into A, by Lemma VII sends H*(AXA°) into H(A{A), and so 
sends the eigenvalues of H°(AXA°) into the eigenvalues of H(AXA). The eigen- 
values of H°(A¥A°) are known to be power products of the a} and so those of 
H(\XA) will be the same power products of the a;. Therefore, the mapping 
— w* sends the eigenvalues of H(A{A) back into those of H°(A¥A°) and so 
we have [3. P. 84; 10. P. 203]. 


(55) xa(A) = {A(A°)} = 


In other words the character of (A) (for G) is the well known Schur function 
of the complex number counterimages of the modular eigenvalues. We set 
xa(A) = {\(A)}, and when the argument A is unimportant we drop it leaving 
just. {A}. 

In Part I it was proved that for a given tensor representation (such as the 
§(\) of Lemma IX) the decomposition into irreducible and indecomposable 
constituents was independent of the field f, provided merely that f was not too 
small [8. P. 675, Th. III]. Hence formula (55) can be used in determining the 
irreducible constituents even though the field f is infinite. 


§11. Irreducible representations 


TarorEM VIII. Jf £ is any finite field then every irreducible (modular) repre- 
sentation of the full linear group @ = G(n, f) is equivalent to a tensor representation.” 

Proor: Our procedure is to first determine the number of irreducible modular 
representations of @ and then to prove the existence of that many distinct irre- 
ducible tensor representations. 

Brauer and Nesbitt [1. P. 562] have shown that the number of irreducible 
modular representations of any group is equal to the number of classes of con- 
jugate elements with order prime to p. Now a f-matrix A is of order prime to 
p if and only if it is similar to a diagonal matrix (perhaps in some extension 
of f); i.e. if and only if the minimum equation of A has no repeated roots. Now 
for any f-polynomial of degree n with leading coefficient unity and constant term 
different from zero.there is just one class of similar matrices A with this polyno- 
mial as characteristic equation and minimum equation without repeated roots. 
The number of such polynomials is g”*(q — 1) where q is the number of elements 
in f. The above argument thus shows that there are exactly q" ‘(q — 1) irre- 
ducible modular representations of G. 


* See footnote 1. 
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In the second part of the proof we employ the notation and constructions of 
the preceding section. We proved there that every Schur function {)} is the 
character of some modular representation of ©. If the eigenvalues of A are 
, then {(A)} is a symmetric function of w*!, w%, ---, w™, 
Now every elementary symmetric function of n variables can be expressed as a 
linear combination of the {} with integer coefficients, and conversely each is 
an integral linear combination of power products of the elementary symmetric 
functions [3. Chapter VI]. If we consider the values of all the functions {)} 
only for arguments of the form w*?, w™, ---, w™ that come from an A in @ 
there will be certain linear relations, with rational integer coefficients, connecting 
them. Let 


(56) a(a){a(A)} = 0 


be any such linear relation holding for all A in G. We lose no generality in 
supposing that the a°(\) are rational integers not all divisible by p. If we ex- 
press the {\} in terms of the elementary symmetric functions /;,--- , E, of 


| 


wt, +--+ , w*" we get from (56) a relation of the form 


(57) Ei: -Him = 


where again the c’(j) are rational integers not all divisible by p. All the terms 
in (57) are integers in f° and so by taking residue classes modulo that prime ideal 
which maps the integers of f into f we get 


where the c(j) are elements of the prime field GF(p) in f, not all zero; and the 
e;’s are the elementary symmetric functions of --- , 

Hence the number of (integrally) linearly independent characters {\(A)} is 
at least as great as the number of monomials in ¢, --- , é, that are linearly 
independent over the GF(p), where e,, --- , é, range over f subject only to the 
restriction e, ~ 0. Now there are g”*(q — 1) monomials of degree less than q 
in @,°*:, @n1, and of degree less than g — 1 in e, , and these monomials are 
even independent over f. (For 7 < n we must allow all exponents less than q 
for e; , but any exponents exceeding g — 1 can be reduced using the identity ef = 
e;. For i = n the condition e, ~ 0 enables us to reduce all exponents below 
q — 1 by application of the identity es = 1. The independence of the result- 
ing monomials is covered by the discussions in Part I §3, see also §14 below.) 

Now the number of distinct irreducible tensor representations is certainly at 
least as great as the number of independent characters in any set of tensor 
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tations, in particular it must be at least as great as the number q”” 
q- 1) of independent {A}. But we saw above that the total number of modu- 
iy irreducible representations was q” '(q — 1) and so the theorem is proved. 


§12. General program for calculation of modular tensor characters 


Consider an indecomposable (direct) constituent U of the tensor representation 
4 1,(A) of G@(£ now a field of characteristic p with at least m elements in it, 
and fa p-adic closed field which induces f). The space for Ul has the form e&%,, 
where ¢ is a primitive idempotent of the f-group ring R of S,. Let e° be an 
idempotent counterimage of ¢ in §° [5. P. 368], In general e¢’ is not primitive, 
so suppose = + + --- where is associated with the partition (A‘) of m. 
(Go far as we now know several of the e{ may be associated with the same parti- 
tion.) Now e°%», is space for a representation U° which induces U in the same 
sense that °() induces (A). The irreducible constituents of U will therefore 
be the same as those of the representation 


$0’) 
Y’ = . . Hence the character of Ul 


will be + +--+. Thus the characters of the indecomposable con- 
stituents of the tensor representation 7, are known as soon as the decomposition 
numbers of S,, are known [1. P. 558; 7. Pp. 657-8]. At present those are known 
only form < 2p. 

To calculate the characters of the irreducible constituents of II,, requires 
knowledge of the irreducible constituents of each indecomposable direct con- 
stituent U, and in general probably further information as well. 


§13. Modular tensor characters for m < 2p 


For m < p R(= Rim, f)) is a semi-simple algebra and so each counterimage 
idempotent e° of a primitive idempotent e is again primitive, and each indecom- 
posable constituent U1 of II,, is irreducible. Hence the (A) are the irreducible 
(and indecomposable) constituents of zm ; and thus all characters are known. 

For p S m < 2p the nature of the U’s is given in Part I (Formula (50) and 
Theorem VII). If (A) is a partition without a hook of length p [7. Pp. 665-6; 
5. Part I], then (in the notation of Theorem VII) [8. P. 683] G(A) = H(A) is an 
irreducible, indecomposable direct constituent of II, , and so its character is the 
Schur function {X}. 

The remaining partitions of m are grouped together in sets of p according to 
the partitions (u) of m — p obtained by removing the hook of length p, those 
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belonging to a given (u) being labelled \’(u), --- , \”(u) according to the height 
[7. P. 665] of the hook. Then from formula (50) we get ) 


(59) = + + xe4w, t= 


and from the decomposition numbers for S», : 
(60) = + OTD), t= 


(We recall the conventions of Part I for discarding non-existent parts which ; 


appear formally [8. P. 683].) 

By Lemma IX and the argument preceding it we know that the irreducible 
constituents of the (A) are irreducible constituents of II, . Equations (59) 
and (60) show that $(A*()) can have as irreducible constituents only G’s belong- 
ing to (u). Thus we can write 

im=p—l 
(61) } » t=1,---,pD, 
7=0 
for non-negative integers a;; (= aij;(u)). Substituting (61) in (60) and equating 
coefficients of the @’s in the two expressions (59) and (60) we get 


lifj=i-1 
2if7 =7 

0 otherwise 


- Hence a;; = 0 unless j = i orj = 7 — 1, and then it follows that a1 = ai = 1, 


in other words we have 


(62) {r‘(u)} = + XGi(u) i= 1, 
and the other way around 


These last two formulae can be used to compute the characters of Kronecker 
products of tensor representations, provided the total rank never reaches 2p. 
The following theorem summarizes the above results. 

THEOREM IX. Suppose m < 2p and that f has at least m elements. If (A) is a 
partition of m without any hook of length p, then (A) is an irreducible direct con- 
stituent of Im. If (u) is a partition of l = m — p then $(A‘(u)) has the irreducible 
constituents G;1(u) and G;(u), each with multiplicity one. 


§14. Existence of a faithful tensor representation of the group ring of the full 
linear group 


Let f be a field of characteristic p and of finite order g. We are interested in 
the f-group ring ! = T (n, f) of the full linear group G(n, f). The f-enveloping 
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; algebra of any representation of & is of course a representation of !. In particu- vet 

eight lyr, the f-enveloping algebra %, of I, affords a representation of I’. | a 

X. Let = n(q — 1). Then is isomorphic to the direct 
sum + of tts representations A, . 

PROOF: Ay Part I (§§3, 4) we saw that the order of Y,, is the number of linearly 
independent forms in N = n variables yi, (whose domain is f) no non-trivial 
linear combination of which is annihilated upon multiplication by D = det || yix ||. 

hich Similarly the order, ¢, of the direct sum % + --- + %, is the number of linearly 

, independent polynomials of degree c or less in the yj, no non-trivial linear com- | 
ible bination of which is annihilated by D. To prove our theorem it is sufficient to 
59) show that ¢ equals the order of I; for if the homomorphic image of an algebra 
| has order equal to that of the algebra then the homomorphism is an isomorphism. 
= The set of all f-polynomials in N variables y; whose domain is f is a commuta- 
tive falgebra S$. By means of the identity y* = y which holds for all elements 
of f we can replace any polynomial P in the y; by a polynomial P* in which no 
exponent exceeds q — 1. If P already has all exponents less than g we call it a ‘oy 
reduced polynomial and in general call P* the reduced form of P. Thus we see ah 
that the order of $ cannot exceed q”, wait is the number of reduced monomials N 
in N variables. 

To show that the order of % is schélaaty equal to q’, we exhibit the q” poly- 
nomials 


ing 


one for each N-vector (a) = (a;, --+ , aw) over f. Unless y; = a;,7 = 1, i 
i, , N, we have e(a) = 1. The | 
e(a) are clearly linearly independent, and so fein a ‘banid for $. It is easy to see 
that the e(a) are orthogonal idempotents, and this shows that $ is a semi-simple 
algebra, being the direct sum of q” copies of f. 
If P is any element of & then the set of all Q in $ which annihilate P form a 
two-sided ideal denoted by $(D). The element P’ = 1 — P*” is a generating 
' idempotent for this ideal. To prove this we note first that P’ is an idempotent 
amnihilator of P and second that if QP = 0 then QP’ = Q — QP" = Q. 
Returning now to our case of N = n’ variables yi , we are interested in the 
annihilating ideal $(D). It will have f-order s where s is the number of the 
idempotents e(ax%) which belong to $(D). Now De(aix) has value det || ai || 
if yx = aj, (for all i and k) and value zero for all other choices of yx. Hence, s 
equals the number of Fosule n-rowed f-matrices, and g" — s will thus be the 


number -(q" — q" of non-singular n-rowed f-matrices. 
That is, — s is the order of G(n, f) and therefore of (n, f). 
Now to complete the proof of our theorem we observe that ¢ = n'(q — 1) 


is the maximum degree of any reduced monomial in n’ variables at 80 the num- 
ber, ¢, of linearly independent polynomials of degree c or less in n’ variables, 
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no linear combination of which lies in $(D) is just q” — 8, i.e. equals the order | 


of 

Theorem X of course implies Theorem VIII, but we keep the independent 
proof of that theorem because of other information which it gives about the 
irreducible representations. 


§15. More about $(D), especially for n = 2 


The proof of Theorem X points the way to the study of each %,. We shall 
call a polynomial P in the ys, a semiform if P = Q* where Qisaform. Then the 
order of %,, is the number N(n’, g, m) of linearly independent semiforms of de- 
grees m,m — q + 1, m — 2(q — 1), --- minus the number D(n’, q, m) of linearly 
independent forms of these same degrees which lie in the ideal $(D). (In the 
sequence of degrees m — i(q — 1) we stop with the term for which 1 < m — 
i(q — 1) <q.) In the following section we compute N(n’, g, m) for all m < 29; 
and in the present section we study D(n’, q, m) for m < 2q with especial attention 
(including additional material) for the case n = 2. 

One important question, as yet unsolved for the general n is the lowest degree 
m = M for which D(n’, q¢,M) ¥ 0. The relation yy(D** — 1)D = 0 shows that 
M Ss n(q — 1) +1. It can be shown (by a rather direct but clumsy and un- 
interesting argument which we omit) that forn > 2, M > 2q—1. Forn=2 
we shall now show that M = 2g — 1, and shall moreover give a complete deter- 
mination of the structure of $¥(D) for this case. 

We recall that the order of the ideal $(D) is the number s of singular f-ma- 
trices; which is, for n = 2, q° + q’ — q. It is clear that we can select a f-basis 
for $(D) from products of its idempotent generator D’ = 1 — D*” by monomials. 
We now show that there are only s distinct products of D’ by monomials; it will 
then follow that these are a f’-basis for $(D). 

We associate with the monomial M(a) = Ty the matrix (a) = || a(ik) || 
of exponents, the a(zk) being rational integers less than q. Py repeated applica- 
tions of the identity 


(64) Yny2D! = yrynD’ 


(obtained from DD’ = 0) we see that each product M(a)D’ is equal to some 
product M(8)D’ where B(12) 6(21) = 0; and so henceforth we consider only 
monomials M(a) with a(12) a(21) = 0. Next, if a(21) = 0 and a(12) + min 
(a(11), a(22)) = q we can use the identity (64) to replace M(a) by a monomial 
M(6) of lower degree and still have M(a)D’ = M(g)D’. This new (8) might 
have (12) B(21) ¥ 0; if so we can again apply (64) to get M(y) of the same 
degree as M(), with y(12) (21) = 0, and with M(y)D’ = M(a)D’. A similar 
argument applies if a(12) = 0. So we are justified in henceforth restricting 
ourselves to monomial multiplers M(a) with 


(65) a(21)=0; a(12) + a(21) + min (a(11), a(22)) < g. 
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There are still further eliminations possible, but for these we must study the 
form of D’ somewhat closer. Since D** = D*/D we have 


D™ = yfyf’ + yaya + 
from this we see that if a(11) o(22) = 0, the first two terms of M(a)D’ cancel 
leaving the g — 1 terms 


1 


where exponents exceeding g — 1 are to be reduced by g — 1. The terms in 
this sum form a cycle, each term being obtained from the preceding by subtract- 
ing 1 from the exponents of yu, yo: and adding 1 to the exponents of yx, yn, 
except that when subtraction would leave a zero exponent we put in g — 1 and 
when addition would give g we put in 1. (In particular this rule applies from 
the last term to the first, hence the word “cycle.”) Thus the whole cycle is 
determined by any one of its terms, and two such cycles will either be identical 
or have no term in common. We can therefore characterize the whole sum 
M(a)D' by its term involving y{;’, i.e. the one with vy = a(11): 
(66) yi oom, 
where some of the exponents may have to be reduced by g — 1. 

It is clear that no two distinct monomials both with a(12) = 0 or both with 
a(21) = 0 can give rise to cyclic products having the same term (66). However, 
if a(12) = 0, 6(21) = 0, and 


a(22) — a(11) = B(22) — B(11), a(11) = B(11) + B(22), 
a(11) + a(21) = B(11) mod (q — 1) 


then M(«)D’ = M(g)D’. The congruences (67) set up a one to one correspond- 
ence between all monomials M(a) with a(11) a(22) a(21) ¥ 0, a(12) = 0, and 
all monomials M(B) with 6(11) B(22) 8(12) 0, 8(21) = 0. For our calculation 
of the number of distinct products M(a)D’ we need to know for how many 
corresponding pairs (a) and (g) defined by (67) do both (a) and (8) satisfy the 
second condition of (65)? For the sake of definiteness in choosing a basic set of 
monomials we will agree to keep the one in such a pair which has 6(21) = 0, and 
discard the other. Solving for (8) in (67) we get 


B(11) = e(11) + a(22) mod (q— 1), = a(22) + a(21) mod (g — 1), 
B(12) = g— 1 — a(21). 


First case: If both a(11) + a(21) < q and a(22) + a(21) < gq then the con- 
gruences in (68) can be replaced by equalities. Hence 8(11) + A(12) = q — 1 
+ a(11)-and B(22) + g(12) = q — 1 + a(22) are both greater than g — 1 so 
that (8) does not satisfy (65); by our agreement, all such («) must be kept. 


(67) 
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Second case: If a(11) + a(21) = gq then B(11) = a(11) + a(21) — ¢+ 1 and 
so (8) satisfies (65). Similarly if a(22) + a(21) 2 q then (8) satisfies (65). In 
both instances (a) must be rejected. 

The cases discussed exhaust all possibilities; so we are now ready to collect 
the above results and count the remaining monomials. In the foilowing ma- 
trices no a(ik) which appears is permitted to take the value zero: 


a(ll) 
(v) a(21) «a(22) 


There are 1, 4(q rs, 1), 4A(q tig 1)’, (q ba 1)’ matrices of types (i), (ii), (iii), (iv) 
respectively. To calculate the number, Ni, of type (v) we observe that if 
a(21) = j, then there are (q — 1)* — 7’ pairs of values for a(11) and a(22) for 
which a(21) + min (a(11), a(22)) < q and so 


Similarly the number, N2 , of type (vi) turns out to be 


Hence + = (q — — (¢— 1)”. Adding all the above numbers together 
we get g + — q = 8, which was our goal. 

Therefore, the above set of s matrices || a(ik) || give s products M(a)D’ which 
are a f-basis for $(D). If || a(zk) || is one of the above matrices we shall refer to 
M(a) as a selected monomial. Note that no selected monomial has degree ex- 
ceeding 2g — 2. 

This completes the general study of the ideal S(D), but for application to the 
study of %,-1 a little further analysis is needed. 

Lemma X. There are exactly 4q linearly independent semiforms of degrce 
2q — 1 in the ideal 3(D). 

Proor: Any semiform of degree 2q — 1 is a sum of monomials of degree 1, 4, 
and 2g — 1. There are 4 selected monomials of degree 1, and 4(q — 1) selected 
monomials of degree q and of type (iii); for each of these monomials the product 
with D’ has degree 2g — 1. The remaining selected monomials of degree q have 


a(21) + min (a(11), a(22)) < 


, «(12) + max (a(11), a(22)) < g. 
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(11) (22) # 0 and so their products with D’ have the cyclic character de- 
yribed above: no two having a term in common, and all of degree 3q — 2. 
Hence no f-combination of such products can have degree 2g — 1. Since there 
gre no selected monomials of degree 2q — 1 our proof is now complete. 
§16. The calculation of N(n’, g, m) for g < m < 2q 

The number N(N, g, m) of linearly independent semiforms of degrees m, 
m-q+ 1, --+ in N variables y; is clearly equal to the total number of reduced 
monomials of these degrees. For a monomial M of degree m < 2q the reduced 
monomial M can differ from M in at most one variable. In the following argu- 
ment we first assume g S m < 2¢ — 1 and then supply a modified argument for 
the remaining case m = 2g — 1. 

If M* # M, then M* has degree 1 = m — q + 1, so N(N, gn) = number 
of monomials of degree I plus the number of reduced monomials of degree m. 
let M* be a monomial of degree / involving exactly 8» variables; 6; to the first 


degree, B to the second degree, --- , B; to the i degree, ---. Then 8) = 6; + 
hes + 6, and 1 = B, + 26. + --- + 16. M* is the reduced monomial of 


exactly 8) monomials M of degree m, namely, those obtained by adding g — 1 
to the exponent of each variable in turn. There are 


Bo 1 Be Bi B,! Be! eee B,! 
different monomials of degree | having the same exponents (aside from arrange- 


ment) as M*, Every non-reduced M can be obtained in this way for some 
partition (8) of 1. Hence, if we set 


=D 


the sum being over all partitions (8) of 1, we have that N(N, g, m) + RN, q, l) 


=(" sgink ') = total number of monomials of degree m in N variables. We 


now calculate R(N, q, 1). 
This lemma is the case 7 = 0 of the following formula: 
-j-2 Bo! 
71 


This formula is readily verified for small N; for any N and j = | — 1 it becomes 
the identity 
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since there are no terms on the right with 8) < j and there are only two partitions 
of l with B®: 2 7 =1— 1. Now we proceed by induction; i.e. we consider a, par- 
ticular N and j and take as induction hypothesis that (71) is valid for all smaller 
N, and for the given N with all larger j7. Apply the general binomial relation 


= ') + to the left side of (71) and we get two terms which 


are the left sides of (71) for — 1andj, and for N'andj +1: Theeame happens | 


on the right hand side, and this completes the induction argument. 

The above computations were based on the fact that for M # M* the differ- 
ence in degrees was g — 1. For m = 2q — 1 the difference in degrees can be 
either g — 1 or 2q — 2, but the latter arises only for the N monomials yj" and 
since they give rise to N distinct reduced monomials, we can apply Lemma XI 
for 1 = q to obtain the number of independent “relations” between monomials 
of degree 2g — 1. 


§17. Preparations for the reduction of II,, for p S m < 2p — 1, andq = p 


The number R(n’, p, m — p + 1) calculated in the previous section can be 
thought of as the loss in the order of &,, due to replacing a f with more than p 
elements by the f with p elements. We now show how to account for this loss 
of order in terms of the representations. . 

Our procedure is first to present II,, in the reduced form given by Theorem 
VII (which is, of course, valid for all f), and then to consider what further re- 
ductions are possible when f has just p elements. We will need to study the 
case of general f in more detail than is given by Theorem VII; to minimize the 
shuttling back and forth from one f to another we will first do all the work for 
the larger f and then shift to the smaller one. 

Since a — a” is a homomorphism (automorphism if f is finite) of f into itself 
the mapping || ai || — || a% || is a representation of the full linear group G. 

For the rest of this section we suppose that f has g > p elements. Then the 
character of the above representation of © is readily computed as s, = sum of 
the p™ powers of the complex number counterimages of the modular eigenvalues 
of || ax ||. Now we have [3. P. 86] s, = {p} — {p — 1,1} + --- + {17}. For 
m = p, we call (0): 0 = 0 a partition of zero, and so in the notation of Theorem 
VII we have (p — i, 1°) = \**"(0). Then from formula (63) we get 


(72) = — + {NO} = 


Applying the Brauer-Nesbitt criterion that two irreducible representations are 
equivalent if their characters are equal we see that (with proper choice of basis 
elements) if A = ||-aj || then G@o(OXA) = || a% |]. 

Lemma XII. If f has more than p elements and 1 < p, the enveloping algebra 


© of Go(0) X Ths has order h equal to the product of the orders of the factors, 1.. 
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Furthermore, the Kronecker product of Go(0) by any irreducible constituent @(u) of 
[1.1 is the irreducible constituent Go(u) of S and H is the direct sum of such con- 
stituents. 

Proor: The elements of II;_;,are monomials of degree 1 — 1 in the elements 
a, of A. Hence, if the order of the Kronecker product were less than h there 
yould have to be a non-zero form P(bi , ax), each term linear in the variables 
b, and of degree 1 — 1 in the variables a;, , such that P(ai,, au) = 0 for all 
values of the ax. Sincem = p+1—1 <p’ S gq this means that P(A} , Ax) 
must vanish identically as a form in n° indeterminants Ay. But this cannot 
happen, since Ai,.M = Aj,»M’ with M and M’ monomials of degree 1 — 1 < p 
requires 1 = 7’, k = k’, and therefore M = M’. In other words P(Ai, , Aix) has 
exactly the same coefficients as the initial P(bi., au). This establishes the 
value of h. 

The same argument can be used to establish the irreducibility of G(x) X Go(0) 
provided we take for M and M’ forms of degree 1 — 1. The equality G(u) X 
G,(0) = Go(u) follows from the character formula (63), and the relation 


This last relation is established by a short computation based on a theorem of 
littlewood [3. P. 70, Th. IT]. 


§18. The reduced form of II,, for p S$ m < 2p — 1,andq = p 


In this section we take f with exactly p elements. Our procedure is first to 
present II,, in the reduced form given by Theorem VII (which is, of course, 
valid for all f of characteristic p), and then to consider what further reductions 
can be made due to the shrinking of the size of f. 

Since a” = a for all a in we have that @(OXA) = || a% || = || aun || = A; ie. 
G0) = TI, . Hence, @(0) = and if 1 < p this is fully reducible of 
order h’ = (" rain hy '). We regard II,, as being the product II, X I. and 
thus see that each irreducible constituent of the S of Lemma XII must now de- 
compose into a direct sum of irreducible constituents of I,. The irreducible 
constituents of © were irreducible constituents of II, and might appear in the 
product I, X Ij; in various places, but wherever they appeared they must now 
decompose. The actual form of this decomposition can be determined by 
studying the characters, making use of the relation sp = s, which holds for our 
present f. (We shall make frequent use of such computations below in the treat- 
ment of the case m = 2,m = 2p —1,q =p.) The difference h — h’ is the loss 
in order of %» due to the relation G(0) = Ih. Buth — h’ = R(n’, p, 1) which 
we have already seen to be the total loss in order of Y,,. This shows that the 
only modifications needed to bring Am in fully reduced form for our case are 
those described above on the irreducible constituents that appear in $. (For, 
because of the independence of the various matrix sets appearing in the reduced 
form of II, in Th. VII any further reductions would entail further loss in order.) 
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since there are no terms on the right with 8) < j and there are only two partitions 
of with 2 7 1. Now we proceed by induction; i.e. we consider a par- 
ticular N and j and take as induction hypothesis that (71) is valid for all smaller 
N, and for the given N with all larger j7. Apply the general binomial relation 


= (" ') + to the left side of (71) and we get two terms which 


are the left sides of (71) for N — landj, and for N andj +1. The same happens 
on the right hand side, and this completes the induction argument. 

The above computations were based on the fact that for M # M* the differ- 
ence in degrees was q — 1. For m = 2q — 1 the difference in degrees can be 
either g — 1 or 2q — 2, but the latter arises only for the N monomials y{*" and 
since they give rise to N distinct reduced monomials, we can apply Lemma XI 
for | = q to obtain the number of independent “‘relations” between monomials 


of degree 2g — 1. 
§17. Preparations for the reduction of II,, for p S m < 2p — 1, andq = p 


The number R(n, p, m — p + 1) calculated in the previous section can be 
thought of as the loss in the order of Y,, due to replacing a f with more than p 
elements by the f with p elements. We now show how to account for this loss 
of order in terms of the representations. . 

Our procedure is first to present II,, in the reduced form given by Theorem 
VII (which is, of course, valid for all f), and then to consider what further re- 
ductions are possible when f has just p elements. We will need to study the 
case of general f in more detail than is given by Theorem VII; to minimize the 
shuttling back and forth from one f to another we will first do all the work for 
the larger f and then shift to the smaller one. 

Since a — a” is a homomorphism (automorphism if f is finite) of f into itself 
the mapping || ai || — || a2. || is a representation of the full linear group G. 

For the rest of this section we suppose that f has g > p elements. Then the 
character of the above representation of © is readily computed as s, = sum of 
the p™ powers of the complex number counterimages of the modular eigenvalues 
of || az ||. Now we have [3. P. 86] s, = {p} — {p — 1,1} + --- + {17}. For 
m = p, we call (0): 0 = 0 a partition of zero, and so in the notation of Theorem 
VII we have (p — i, 1°) = 4*"(0). Then from formula (63) we get 


(72) xen = {'(0)} — + + = 


Applying the Brauer-Nesbitt criterion that two irreducible representations are 
equivalent if their characters are equal we see that (with proper choice of basis 
elements) if A = || then Go(OXA) = || a% |]. 

Lemma XII. If f has more than p elements and 1 < p, the enveloping algebra 
© of Go(0) X Ths has order h equal to the product of the orders of the factors, i.¢. 
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Furthermore, the Kronecker product of Go(0) by any irreducible constituent @(u) of 
[1.1 8 the irreducible constituent Go(u) of and H is the direct sum of such con- 
stituents. 

Proor: The elements of I;_;,are monomials of degree 1 — 1 in the elements 
a; of A. Hence, if the order of the Kronecker product were less than h there 
vould have to be a non-zero form P(bs% , ai), each term linear in the variables 
ba and of degree 1 — 1 in the variables a;, , such that P(ah,, ax) = 0 for all 
values of the a,. Sincem = p+1—1 <p’ < q this means that P(A%, , Aw) 
must vanish identically as a form in n’ indeterminants Ay. But this cannot 
happen, since Af,M = Ajx-M’ with M and M’ monomials of degree 1 — 1 < p 
requires i = 7’, k = k’, and therefore M = M’. In other words P(Ai, , Aix) has 
exactly the same coefficients as the initial P(byx, au). This establishes the 
value of h. 

The same argument can be used to establish the irreducibility of G(u) X Go(0) 
provided we take for M and M’ forms of degree 1 — 1. The equality G(u) xX 
6)(0) = Go(u) follows from the character formula (63), and the relation 


{u} = — + + 


This last relation is established by a short computation based on a theorem of 
littlewood (3. P. 70, Th. IT]. 


§18. The reduced form of II,, for p S$ m < 2p — l,andqg = p 


In this section we take f with exactly p elements. Our procedure is first to 
present II,, in the reduced form given by Theorem VII (which is, of course, 
valid for all £ of characteristic p), and then to consider what further reductions 
can be made due to the shrinking of the size of f. 

Since a? = a for all ain f we have that Go(OXA) = || a% || = || au || = A; ie. 
= 1,1. Hence, G(0) X = I; and if < this is fully reducible of 


2 
= (” We regard II,, as being the product I, X and 


thus see that each irreducible constituent of the S of Lemma XII must now de- 
compose into a direct sum of irreducible constituents of Il,. The irreducible 
constituents of © were irreducible constituents of II, and might appear in the 
product II, X II; in various places, but wherever they appeared they must now 
decompose. The actual form of this decomposition can be determined by 
studying the characters, making use of the relation s, = s, which holds for our 
present f. (We shall make frequent use of such computations below in the treat- 
ment of the case m = 2,m = 2p —1,q =p.) The difference h — h’ is the loss 
in order of %, due to the relation @)(0) = Ih. Buth — h’ = R(n’, p, l) which 
we have already seen to be the total loss in order of Xn. This shows that the 
only modifications needed to bring %m in fully reduced form for our case are 
those described above on the irreducible constituents that appear in $. (For, 
because of the independence of the various matrix sets appearing in the reduced 
form of I, in Th. VII any further reductions would entail further loss in order.) 
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We have now proved the main theorem: | 

TurorEM XI. [f £ is the field with p elements andp Sm=p+l—-i< 
2p — 1 then the reduced form of the Kronecker m™ power representation II, of the 
full linear group @(n, f) of non-singular n-rowed f-matrices is given by Theorem 
VII except for the following modifications: the diagonal constituent Go(u) of Up 
(irreducible for € with more than p elements) is now equal to G(u) K Th, which 
is the direct sum @(u') + G(u") + «++ of irreducible constituents of I, ; the parti- 
tions (u'), (u’), «+ being those which appear in the decomposition of the character 
product {u}{1} = {w} + {w} 


§19. The decomposition for g = p,m = 2p — 1,n > 2 


Lem. XI and XII both include the case 1 = p, and we know that the order 
of II, is the same for all f. Hence, it is still true that all the changes due to the 
smallness of f are consequences of the relation G(0) = I,. However, this time 
the constituents Go(u) = G(u) X Go(0) will not always split into a direct sum of 
irreducible constituents of II,. But, since G(u) is an indecomposable constit- 
uent of II,1, G(u) X I will be a direct sum of indecomposable constituents 
of II, , and just which will appear can be determined by the character theory. 
With these modifications Th. XI will hold form = 2p — 1,n > 2. Forn = 2 
these modifications are still valid, but no longer tell the whole story, since we 
have then (according to Lem. X) to account for a further loss in order of 4p 
due to the presence of determinant annihilators. 


§20. The decomposition for g = p,m = 2p — 1,n = 2 


For the present section we denote by © the full linear group of two rowed 
matrices for a field with g > p elements and by @* the same for a field with 
q = pelements. Since @* is a subgroup of G any representation T of @ when 
considered only for elements of @* affords a representation of G* which we denote 
by E*. In particular we apply this notation to the tensor representations and 
constituents Im, Um, U'(u), GA), Gi(u), Gi(u) defined above (cf. Th. VII). 

Since n = 2, we have {\} = 0 for any partition (A) of any integer m into more 
than two summands [10, P. 128, Lem. 4.4B]. Hence in applications of Th. VII 
for n = 2, q > p we must delete all constituents that arise from partitions of m 
into more than two parts. (Indeed such modifications are always to be made 
when » > m.) 

The following string of lemmas, taken with Th. VII, Th. XI, and the results 
of §19 complete the analysis for n = 2, m = 2p — 1, q = Pp. 

Lemma XIII. Every partition (A): m = dy + de for m = 2p — 1 and m = 
2p — 3 has a hook of length p; indeed, if 1 — 2 = p then (A+) = (1 — P, »») 
and if — < p then (A) = — 1, — pt 1). 

We co se to stand for det A whether A is in G or in 6. We recall that 
for n = = {17} = @(1’). 

ince saleaibee — 3 = wu + m is any one or two element partition of 
p — 3, and we denote by (u)p the partition p — 1 = (us + 1) + (ue + 1) then 
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aiy)}-D = {\‘(u)o}, 4 = 1, 2. Furthermore, \\(p — 1) is the only one or two 
dement partition of 2p — 1 which does not have the form d*(u) p. 
Lemma XV. As (u) runs through all one or two element partitions of p — 3 


Go(u) Go(u) » Wi) = Gu) 
Gifu) Gi(u) 


give the indecomposable constituents of Ile». The U'(u)* are the indecomposable 
smstituents of The decomposed form of W'(u)* differs from that of U'(u) 
oly in that Go(u)* = G(u)* X Ih* ts a direct sum of (at most two) irreducible 
representations of @*. In particular, for each (u) the enveloping algebras of U'(u) 
and U'(u)* have radicals of equal order. 

Lemma XVI. For every one or two element partition (u) of p — 3 we have (i) 
= U'(u) D; Gi) = U'(u)* X D is an indecomposable constituent 
of and (iii) X D has the same order as Ury3*. 

@(1)* 
@i(0)* 6,(0)*|| 


Lemma XVIII. Let O(%n*) denote the order of Um*. Then O(Yy1*) — 
0(%»-s*) = (p — 
Lemma XIX. In U'(p — 2, 1)* we have 
i) Gi(p — 2, 1)* = G(1)*; 
* 
i) — 2, 1)* = G(»v + 1, 


||. 


W(x) = 


Lemma XVII. W(p — 1)* = U'(0)* = 


where 2v = p — 1; 


(ii) — 2, 1)* = 


(iv) Gi(p — 2, 1)* = || 


Here, the refinement into submatrices in (iii) and (iv) is coherent with that in (ii) 
and the single ““*’ denotes a submatrix in whose form we are not now concerned. 

To prove Lemma XIII we need only to observe that \; 2 >» and Ai + Ax ]> 
2p — 3 imply that 4. = p — 1. 

Proor or Lemma XIV. For n = 2 we have {d1, Ax} {1} = (+ 1) 
according to the product rule for Schur functions [3. P. 94, Th. V]. But if 
(1, Xe) = A‘(u) then (Ay + 1, 2 + 1) = A‘(u)d. For the second part of the 
lemma we note that for any partition (\’): 2p — 1 = M +2 With es # 0 we 
have (\’) = d*(u)p where (u) and ¢ are defined by (Ai — 1, 42 — 1) = A‘(u). 

Lemma XV is a restatement, for the present case, of Th. X, taking account 
of the partition structure for m = 2p — 3 given in Lemma XIII. 

Proor or Lemma XVI. Since D is a one rowed representation, multiplication 
of any representation by D cannot change its structure provided no annihilators 
of D are encountered. But no annihilator of D has degree less than 2p — 2, 
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and so %2p-3* X D has the same order as %2,;*. Furthermore, U‘(x) xX D and 
U‘(u)* X D will be indecomposable and will have enveloping algebras ‘isomorphic 
to those of and U'(u)*, respectively. 

Now since IIzy-1 = Izp-s X Iz and D is an indecomposable constituent of Il, , 
U'(u) X D must be an indecomposable constituent of Iz): ; then from Lemma 
XIV and the character formulae of §13 it follows that U‘(u) X D = U'(u)p. 
Now put stars on both sides of this equation and we get the remaining assertion 
of the present lemma. 

Proor or Lemma XVII. Since \*(p — 1) is a three element partition, 
U'(p — 1) is irreducible and equal to Go(p — 1), which is in turn equal 
to G(p — 1) X G(0), (ef. Lem. XII). Hence, U'(p — 1)* = G(p — 1)* x 
Go(0)* = G(p — 1)* K G(1)*. Now by the argument made in §19, the character 
relation {p — 1}{1} = {p} + {p — 1, 1}, and the formulae of §13 
we get U'(p — 1)* = U'(0)*, of which the detailed form is given by Th. VII. 

Proor or LemMa XVIII. In §15 we showed that 0(%2p-s*) = N(2’, p, 2p — 3) 
and 0(%s)-1*) = N(2°, p, 2p — 1) — 4p. The value of N(N, q, m) for m < 2% 
is given by Lem. XI. Computing with these facts we arrive at the equality 
stated in the present lemma. 

Proor oF Lemma XIX. According to Lem. XV we have G;(p — 2, 1)* 
W(p — 2, 1)* which has the character {\°(p — 2, 1)} = {p, p — 1} = {1}D”* 
{1}. This establishes part (i). : 

- Part (ii) follows from application of Lem. XV and XVI for (u) = (p — 3). 

According to Lem. XIV, XVI, and XVII, U'(p — 1)* is the only indecompos- 
able constituent of I,3:* which does not appear in IIz,_;* X D. Hence, by Lem. 
XVI and XVIII the order of the enveloping algebra of 


— 1)* 


mest x DI 


is just (p — 1)’ more than the order of I,;* X D. Reference to Lem. XVI 
and study of the characters of irreducible representations appearing there 
shows that G,(0)* does not appear twice in any indecomposable constituent of 
TIep-3* X D. Hence, the radical constituent ©}(0)* of U'(p — 1)* cannot appear 
in IIep-s* X D. Now by Th. VII and XI the degree of G,(0)* = G(p — 1, 1)* 
is p — 1; and so @i(0)* contains (p — 1)’ independent matrices. As a conse- 
quence of this, every remaining constituent of 11'(0)* must appear somewhere 
in IIzp-3* X D. We have seen in Parts (i) and (ii) of the present lemma that the 
irreducible constituents of U'(0)* occur also in 1(p — 2, 1)*; and a study of the 
characters shows that G,(0)* occurs nowhere else in Iz,:.*. Now @(0)* can 
only be equal to some constituent appearing (in Ilz,-3* X D) below G,(0)* and 
to the left of G(1)*; this forces Part (iii) of our lemma. The same considerations 
relative to. @i(0)* gives Part (iv). This completes the proof of Lemma XIX. 
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CONSTRUCTION OF CENTRAL SIMPLE ASSOCIATIVE ALGEBRAS 


By N. JACOBSON 
(Received December 20, 1943) 


It is well known that any central division algebra 2 over & contains a subfield 
P of dimensionality n if n’ is the dimensionality of % over. Moreover the field 
P may be chosen to be separable over @ and there exist algebras of the form 
8 = A xX &, that contain normal and separable subfields of dimensionality nr 
over ®. The algebra % is a normal crossed product and the theory of these 
algebras has proved to be of great importance in the study of central division 
algebras. Nevertheless the passage from %& to B is somewhat artificial. A 
more direct procedure would be to study % relative to any one of its maximal 
subfields P or, at any rate, relative to a maximal separable subfield. This is 
true especially since it is not known whether or not 2% possesses a normal separable 
subfield of dimensionality n. The beginning of the study of an algebra relative 
to an arbitrary subfield was made by Professor Wedderburn in a beautiful paper 
appearing in 1921.’ Here Wedderburn showed that % can be regarded as a 
vector space over P and that one obtains in this way a representation of P by 
matrices with elements in P. 

In a recent paper’ the present author has considered the general theory of 
self-representations of a field. As we show here, this theory can be applied to 
investigate the structure of a central simple algebra of dimensionality n’ relative 
to any separable subfield of dimensionality n. We shall show that the self- 
representations of % defined by P are the regular self-representations and that 
% may be written as a type of crossed product (P, Z, c) where Z denotes a regular 
self-representation and a is a factor set. We obtain associativity conditions, 
conditions that % be simple and that % be the complete matrix algebra 4, . 
As an application of the theorem that the automorphisms of ©, are all inner we 
obtain necessary and sufficient conditions that an element of P, have the form 
aa, a in P and a — a” a regular self-representation. 


1. Self-representations of fields 


In this section we collect some of the results of the theory of self-representa- 
tions that we require in the sequel.’ We recall first the definition of a double 
P-module Y, P a field, to be a left and a right P-module satisfying the following 
conditions: 


1. lr 21 
2. (ax)8 = a(x8) 
3. (A: Pr) =n < 


! On division algebras, Trans. Am. Math. Soc. 22 (1921), 129-135. 

* An extension of Galois theory to non-normal and non-separable fields, Am. Jour. of Math. 
vol. 66 (1944), pp. 1-29 referred to hereafter as E. 

’ Cf. E for the results stated in this section. 
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Here Pp denotes the set of right transformations x — xa. If we choose a right 
basis » for we may write ax; = Then the correspond- 
ence a > a” = (a);) is a self-representation of P, i.e. a representation of P 
by matrices with elements in P. Conversely any self-representation may be 
obtained in this way. A change of basis from 2, --- , 2, to y1, --- , Yn Where 
yi = > zai replaces the self-representation E by the self-representation F 
where a” = M"'a*M, M = (ui;). We call F and E£ similar. 

Ifa > a” is a self-representation, then a — a” = (a”)’ the transposed of 
a’ is also a self-representation. Let ® be the subfield of P of elements 7 that 
are fixed in the sense that y” is the diagonal matrix {y, --- , y} and suppose that 
Pis finite and separable over. Then P = (6) and the self-representations Z 
and E’ are completely determined by the matrix 6”. Since 6” is similar to 
the self-representations and E’ are similar. 

We recall also the definition of a composite (K, S, 7’) of P with itself as a sys- 
tem consisting of a commutative ring K and two isomorphisms S and 7’ between 
Pand subfields P* and P” of K such that 


1) 1°=17, 2) K=P*P’, 3) (K:P”) is finite. 


If Y is the module associated with the self-representation FZ, then %{ determines 
a composite (P,P, L, R) where L is the correspondence between a and the 
left multiplication « — ax and R is the correspondence between a and x — za. 
On the ‘other hand if (K, S, 7’) is a composite, K is a double P-module module 
lative to the multiplications az = = and ra = at = 2a’. 

There is also a second way in which a composite (K, S, 7) determines a 


double P-module. We choose a basis aj, --:, a. for K over Then for 
any a in P we may write 
(1) a® = aim(a)” + 


and the mappings M; : a — yu;(a) are single valued. These transformations are 
endomorphisms in the additive group of P and it is readily seen that the totality 
of endomorphisms >> Mj; where # is the multiplication a — ap in P is an in- 
variant of (K, S, 7), ie. the sets of this type determined by any pair of bases 
ai,---, a, and Bi, --- , B§ are identical. We call the set of endomorphisms 
> Mis; the relations space © of the composite. 

Suppose that the multiplication table of the aj is 


(2) ajay = ares; 
Then a simple computation shows that 
(3) pM; = >) 


Thus © is also invariant under left multiplication by the elements p. It can 
be shown that © has dimensionality n over P (on the right). Hence if we set 
oM = 5M and Mp = Mj for M in G, we may consider © as a double P-module. 
The matrix corresponding to p in the self-representation determined by the basis 
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Mi, M, of S is because of (3), p° where AG ji 
On the other hand, if we regard K as a double P-module as before and we use 
the right basis aj , --- , a, for P we have for any p in P 


= 
Thus the self-representation determined by this basis is p” = (pE;;) where 
pE;; = €;(oM)). Since P is commutative = Hence Gy = E;;. 
THEOREM 1. The self-representations determined by the relations space of a 
composite (K, S, T’) are the transposed representations of the representations deter- 
mined by K. 


2. A special generation of ®,, 


A composite of particular interest is obtained by forming the direct product K 
over ® of P by itself. Here & is a subfield of P such that (P:®) = n < © and 
K = P* x P” where y* = y’ if y e® and (K: P”) = (K: P*) = (P: 4). If 
a1, *** , Gis a basis for P over ® then aj, ---, a» is a basis for K over P’, 
Thus if a® = >> aiui(a)” then the elements u;(a) arein®. The representations 
determined by K are called regular representations and those determined by 
the bases aj, --: , a, in which aj = 1% are the special regular representations 
of P. The matrices of a special regular representation belong to ®, . 

A characteristic property of the direct product is that its relations space S 
is a ring.’ Moreover it can be shown that © is the complete set of endomorph- 
isms in P commutative with the endomorphisms 7, y in ®. It follows that 
S 

Now let a, --- , a, be a basis of P such that a, = 1. Then for any y in 
= = and hence yM,; = y and yM; = ¥ 1. Since aM;«® 
for any a it follows that, 


(4) MM, = M; MM; = O0ifj 1. 


As before for any p in P we have pM; = >> M,(pE;;) where p — p’ is the special 
regular representation determined by the basis aj, --- , a, of K. By means 
of the isomorphism © = 4,, this proves 

TuerorEeM 2. Let P be an arbitrary field over ® such that (P: 6) = n and let 
p—p = (pE;;) be a special regular representation of P over ®. Then P may be 
regarded as a subfield of ®, and there exist elements y,, +--+, Yn in &, such that 
every element in ®, may be written in one and only one way in the form D vii 
pi in P where the y; satisfy 


3. Maximal subfields of a central simple algebra 


We suppose now that % is any central simple algebra of dimensionality n 
over and that 2% contains a subfield P of dimensionality n. The latter as- 


4E, pp. 19-21. 
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smption is always satisfied when % is a division algebra. We turn & into a 
double P-module by defining ax and za to be the ordinary products of a in 
Phyzin Since (P:@) = n, Pr) = n = Ifais any element in 
we denote the left multiplication x — az by a, and the right multiplication 
1720 bya,. It is known that if a,, --- , a, are elements of %& that are inde- 
pendent over ® then > andi; = 0 only if all the b; = 0. In particular 
if, *** , @» form a basis for P over ® then > ainBir = 0 only if all the 6; = 0. 
This implies that the composite (P,P, , L, R) is equivalent to (P* x P’, S, T)’. 

Suppose now that P is separable over ®. Then the regular self-representation 
of P over ® is completely reducible into dissimilar irreducible components. 
Since the self-representation determined by 9% has a composite equivalent to the 
direct product, it is completely reducible into irreducible components that 


include all of the components of the regular representation. A comparison of | 


the ranks of these two self-representations shows that they are similar. 

TaroreM 3. Let & be a central simple algebra of dimensionality n° over ® and 
let P be a separable subfield of dimensionality n. Then the self-representations of P 
obtained by regarding A as a double P-module are similar to the regular self-repre- 
sentations of P over ®, 

Now let 21, --+ , 2, be a right basis of % over P. Then for every a in P we 
have az; = >> x;(aE ;:), where the self-representation a — a” is regular. More- 
over, if 


then the product of any two elements of % is completely determined by the 
multiplication of elements in P, by the self-representation E and by the factor 
sto = {oj}. Since = x) 


and since p(xsv;) = (pai)ax; for all p in P 


4. Crossed products 


We suppose now that P is any finite separable extension of of dimensionality 
n and let % be a double P-module whose representations are regular. 
let, +++, be a right P-basis for and for = >) 2’ = 4p; define 


(9) aa! = 


where the elements o;; are arbitrary elements in P that satisfy (7) and (8). 
it wish to show that % is an associative algebra over &. We note first that 
y (9) 


Equivalence of composites is defined as for field composites. Cf. E, p. 5. 
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(10) (x2x')p = x(z'p). 


Moreover 230;) = Do = Dd Since 
p — p is a representation this implies that 


(11) (xp)x’ = x(p2’). 
We note also that p(x;v;) = (px.)x; and using these relations we may prove that 


and 
aT = (a; En 7). 


Since by (7) 2z:(z,2,) = (xa,)x, this proves that Y is associative. If y «4, 
yx = xy and hence . 


= = (ay)2’ 


so that is an algebra over. Evidently 6) = P)(P: = n’. 

Turorem 4. Let P be a separable extension of dimensionality n over ®, a — a” 
a regular self-representation of P over ® and {oi} a set of elements in P satisfying 
(7) and (8). Then the totality of elements >> xip: of the n-dimensional space over 
P is an algebra if multiplication is defined by (9). 

We shall call 2 the crossed product of P with regular self-representation F and 
factor set o = {oi;,} and we write UY = (P, HL, oc). The conditions (7) and (8) 
will be called the associativity conditions on the factor set «. By the preceding 
section we have 

Turorem 5. Let 2% be a central simple algebra of dimensionality n? over & and 
let P be a separable subfield of % of dimensionality n. Then Y is a crossed product 
(P, E, 
~ The associativity conditions (8) take on a simpler form if FZ is a special regular 
representation. For in this case are in ® and hence = 
The equations (8) reduce to 


If we set o; = (¢,;) this is equivalent to 
(13) = 


Lemma. [If A is a matrix in P,, that commutes with all the matrices p", A is a 
linear combination of the matrices p” with coefficients in P. 

A matrix A in P, commutative with all the p” corresponds to an endomorphism 
A in the direct product K = P* x P” that commutes with all the multiplications 
x — xp* and all of the multiplications  — zp” and hence with all of the multi- 
plications « — za for arbitrary ain K. Since K is commutative, A is a multi- 
plication. It follows that A is a linear combination of the matrices p’ using 
coefficients in P. 
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Thus the elements o; that satisfy (13) are linear combinations of matrices 
p. Each o; will therefore depend on n parameters in P and hence the factor 
4 ¢ depends on n” parameters. 

We suppose again the £ is an arbitrary regular representation. Then we 


may verify that 
(14) | p(x’) = (px)z’. 


In particular if y:, --* , Yn is a second right P-basis for Wf then p(yys) = (pys)y;. 

Hence if yy; = >. and py; = >> then the 7’s and the F,; satisfy 
equations analogous to (7) and (8). If y; = > zai we know that a” = 
M'a"M, M = (ui;). We may verify also that 


(15) Tkij = pry pr) 
where (a;;) = 

TueorEM 6. If M is any non-singular matrix in P,, , then the crossed products 
(P, E, and (P, F, r) where a” = and satisfies (15) are isomorphic. 


5. Conditions for simplicity 


We consider now the conditions that & = (P, HZ, c) be central and simple. 
For this purpose we investigate the structure of the double P-module, A. By 
definition % is isomorphic to the double P-module K = P* X P” obtained from 
the direct product (K, S, 7). Hence we consider K. The submodules of K 
are the (two-sided) ideals of the ring K. Since P is separable over #, K = 
Ki ® Ke ® --- ® K, where the K; are fields. The self-representations deter- 
mined by the K;, are irreducible and dissimilar and any irreducible self-represen- 
tation of P over ® is similar to one of these. In particular one of the K;, say 
K,, is associated with the identity self-representation a + a. Any submodule 

If « + a” and a — a” are self-representations of P the product A xX FE, 
is the self-representation obtained by replacing the elements aE; j of a” by the 
matrices o#%, that are associated with these elements in EZ, .° Let %, be the 
double P-module associated with E, and 2, --- , 2m a basis such that az; = 
and similarly let %, be a double P-module with basis , 
such that ay; = >> y,(aE, 2). Then it is possible to define a double P-module 
having a right basis x; X y,; such that a(t; X = X 
Y(eL nL, »).’ This product module is independent of the choice of the bases 
in%;. It follows from this that if F; is a self-representation similar to EZ, and 
F; is similar to then F; X is similar to X Fe. 

Now let E; be an irreducible self-representation associated with K; and EF; 
one associated with K; and form E; X E;. This representation is completely 

*Cf. E. p. 12 or L. Katusnine, Sur la theorie de Galois de Corps non-galoisiennes separ- 
Rendus de 1’Acad. des Sciences, 214 (1942), 597-599. 
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reducible into irreducible representations determined by, say, Ki,,---, K;,. of 1 
ii If we replace E; and E; by F; and F;; two other self-representations associated in I 
iH with K; and K; respectively, we may associate with F; X F; the same Mm 1 

. ij set Ki,,--:,Ki,. Hence we may define the set (Ki,,---, Ki,) to be the W 
‘i product K; X K;of K;and K;. It is known that relative to this product the 1. 
i set of K; forms a hypergroup 6: the product is associative, K; acts as an identity henc 

et and for each K; there exists a K,; such that K; X K; = (Ki, ---) and anelement T 
Ky such that Ky X Ki = (Ky 2) f 
' We now apply these results to Y = (P, H, co). Then we see that A = %, @ C 
2 ®--- ® A, where Y%; is the double P-module corresponding to K;. Any al : 
submodule of % is a direct sum of certain of the %;. We now define the product \ 
to be the set of sums a,a; where a; and aje%;. By (10) and (14) 
AA; is a submodule of A. Thus %,%; is a direct sum of certain of theA;. Let righ 
%1,°** , tm be a right basis for A; and y1, --- , y, a right basis forM;. Any inde 
element of %,%; is a sum of elements of the form (>> xié:) (>> yjn;) and thismay (: 
. be written as >> avy ti; if we use (11). Thus the right module generated by the end 
elements is the whole module %;%;. We consider now the module men 
It is readily verified that the mapping (2: D> ry ii; is a homo- mor 
morphism between the P-modules %; X W%; and W;%;. Hence the irreducible is a 
components of 9%; ; are contained among the irreducible components of UA; x Y;. N 
It follows from our definition of K; X K; that if Ki; X K; = (Ki,,---, Ki,), a le 
then = © --- Wi,, < r if the K’s are suitably ordered. Since end 
K, X K; = K; = K; X Ki, UA; = 0 or A; and A;A, = O or A;. exis 
We assume now that = = for all X;. In particular Wi = 
Hence if wu is an element 0 in %,, uw” = whered #0. Thene = ud is & T 
idempotent. Since A= = WM = Ae. Hence eis the identity gro 
1 of A. Now % contains the subfield %, = 1P isomorphic to P. Since ax = Jf V 
a(lz) = = (la)xz and za = (rl)a = x(1a) the module operations in of t 
are the left and the right multiplications by the elements of 1P. If % is a two- sucl 
sided ideal, 8 is a submodule of 2% and hence % is a sum of certain of the %;. ts 
This proves V 
TuHeorEM 7. If & has an identity there are only a finite number of two-sided self. 
ideals in A. Any two-sided ideal is a sum of certain of the submodules %; . aut 
If we choose a basis uw, ---, U, of % composed of bases for the %;, a condi- are 
tion that & have an identity may be obtained on the elements y,;; such that We 
UU; = This is that the matrices and be non-singular. be 
For this implies that ~ 0 and 0. the 
We note next that the only elements of % that commute with all the elements con 
of 1P are the elements in this field. For if u is such an element then aw = wa tivi 
for all a. Hence we %, = 1P. Since the only elements of P that are fixed 
under any regular self-representation are the elements of ©, the only elements 
V 
* E, p. 25 or Katusnine loc. cit. in 6. Pro 
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of 1P that commute with ali the elements x; of a P-basis of % are the elements 
in 16. It follows that 1@ is the center of Y. 

Tazorem 8. If W has an identity, A is a central algebra. 

We assume next that for each module %; there exists an A; such %,%; contains 
Y,. Let be a two-sided ideal ~ 0 in A. Then contains one of the A; and 
hence contains It follows that 1 «Band = 

TuzorEM 9. The algebra A is simple if 1) WA; = A; = A, for all A; and 
2) for each WU; there exists an A; such that A;°A; contains A, . 

‘Conversely suppose that % is simple. Then % contains an identity. Since 
al = la, Hence = A; = WM. 

We consider now the set Mt = %W.Pe = Pe, of endomorphisms of the form 
tupy + °** + Gsiper Where a, is a left multiplication z — az, a in A and p, is a 
right multiplication by an element of P. We recall that if a, ---, a, are linearly 
independent over ® then + + Qsiper ~ O unless all the p; are 0. Thus 
M: Pz) = (A: &) = n*. On the other hand the elements of M are 
endomorphisms in %{ that commute with all the elements of Pg. Since the di- 
mensionality of %&{ over Pz is n, it follows that Dt is the complete set of endo- 
morphisms in %{ commutative with the elements of Pe. This implies that M 
is an irreducible set of endomorphisms. 

Now consider any one of the U%;. Form A%;. This is a submodule of % and 
aleft ideal. Hence &%; is a subgroup ~ 0 of & which is invariant under all the 
endomorphisms in J. Since Mt is irreducible AW; = A. It follows that there 
exists an %; such that %,%; contains %,. Similarly there exists an %; such that 
contains %,. This proves 

TaroreM 10. Jf 2 is a simple algebra, the irreducible modules U; form a hyper- 
group H under multiplication. 

We have also noted above the relation between H and the hypergroup 
of the K;, namely, there is a (1-1) correspondence K; — %; between § and H 
such that if K; X K; = (Ki, K;,) then UA; Wi,) where 

> 

We consider now the special case where P is normal. Here the irreducible 
self-representations of P are of rank 1 and hence they are identical with the 
automorphism a — a* of the Galois group © of P over . The modules %; 
are one dimensional over P and are in (1—1) correspondence with the elements S. 
We may denote the correspondent of S by As. Then the condition that % 
be simple is that %.%- = Wsr. We choose any element us ~ Oin As. Then 
the condition that % be simple is that usur = Usrus,r Where us,r ~ 0. These 
conditions for normal crossed products are well-known, as are also the associa- 
tivity conditions on the us,r that one obtains from (7). 


6. Conditions that = 4, 


We have seen that a change of basis from 71, , Zn Yn in a crossed 
product 9% replaces the self-representation EZ associated with the z’s by a similar 
self-representation a > a” = M~‘a*M and replaces the x-factor set « by a factor 
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set 7 defined by (15). If M isa matrix commutative with all the a” then F = £. 
In this case we shall say that the factor sets o and + are associates (o ~ 7). 

In order to determine whether or not a central simple % is the complete matrix 
algebra, it suffices to consider % in the form (P, EZ’, «) where E’ is the transposed 
of a special regular representation. Now we have seen in Theorem 2 that if 1 
denotes the factor set {dij} where Ai jx = then (P, 1) or (P, E’, 1) ~ 
1. The usual proof making use of the fact that an isomorphism between simple 
subalgebras of a central simple algebra 2{ may be extended to an automorphism 
in shows that conversely if (P, ~ 1 theno ~ 1. 

THEeorEM 11. A necessary and sufficient condition that a central simple algebra 

= (P, E’, co) ~ 1 is thato ~ 1. 

Suppose now that % = (P, E’, 1) and that y:, --- , yn is a P-basis such that 
pys = >, and yy; = Let 2n be a second right P-basis 
such that pz; = >. 2z,(pEij) and zz; = 6:3;. Then 2; = >> yjuj where the 
matrix M satisfies the following two conditions: 


(16) a” M = Ma*’ foralla, 


For (17) is obtained from (15) by setting = Tore = and replacing 
E,, by E,,. From (17) we obtain 


and conversely (18) implies (17). Now let M be any non-singular matrix in P, 
satisfying (16) and (18). Then the elements z; = >> yjuj: will satisfy the 
same multiplication table as the y;. Thus (16) and (18) are the conditions 
that the mapping >> yie; — >> zp; be an automorphism A in (P, EZ’, 1). It is 
clear that A leaves the elements of P invariant. Hence A is an inner auto- 
morphism of the form — 6x6". In particular z; = = >> = 
> yuu and so M = B*'8. Conversely if M has this form then (16) and (18) 
hold. 

TuEorEM 12. Let P be an extension of ® such that (P: ®) = n and let E bea 
special regular representation of P over &. Then the conditions (16) and (18) 
oe and sufficient that the non-singular matrix M in P, have the form 

We recall that (16) holds if and only if M is a linear combination of matrices 
a” using coefficients in P. It may be remarked also that separability is not 
required in Theorem 11 since P was arbitrary in Theorem 2. 
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THE AVERAGE MEASURE OF QUADRATIC FORMS WITH GIVEN 
DETERMINANT AND SIGNATURE 


By Cart Lupwia SIEGEL 
(Received May 17, 1944) 


1. Let hg be the number of classes of primitive binary quadratic forms 
Q(z, y) = ax” + bay + cy’ with positive discriminant d = b’ — 4ac; we use the 
narrow definition of equivalence: two quadratic forms are said to belong to the 
same class if they can be transformed into each other by unimodular substitu- 
tions with determinant +1. Let d be no square number, and define eg = 
t+ u/ d)/2, where t, u are the smallest positive integral solutions of 
f-dv = 4 

When d = 4k is divisible by 4, then b is even and Q(z, y) is a properly primitive 
quadratic form of determinant k, in the notation of Gauss. In his Disquisi- 
tiones Arithmeticae Gauss’ ew that the mean value of the expression hy log ex 


is asymptotically equal to k; this means that 
12 


No proof of this statement has been published until now. 

In the first chapter we shall prove the assertion of Gauss in two different 
ways. Both methods may also be used to determine more generally the average 
of ha log €g in any class of residues d = dy (mod m); however, we shall consider 
only the cases m = 4, m = 1, and we shall prove the formula 


3/2 
(2) hy log = N*? + O(N log N). 

The corresponding results for the wide definition of equivalence (unimodular 
substitutions with determinant +1) are an immediate consequence, in virtue 
of the relationship h log « = 2ho log & , where ho denotes the class number in the 
wide sense, ¢ = (t + wor/d)/2, and t, wo are the smallest positive integral 
solutions of § — dui = +4. 


2. In the second chapter we investigate the analogous problem for quadratic 
forms @[r] with m variables and prescribed signature n, m — n, the matrix © 
being integral. Let Tbe a variable real symmetric matrix of signature n,m — n, 
which lies in a domain 7’; let Y denote the domain in the space of all real m-rowed 
matrices 2) which is mapped into T by the condition S[Y] = T; let Yo be a fun- 
damental region in Y with respect to the group of units of S. Denote by 
o(Yo) and v(7’) the Euclidean volumes of Yo and 7’, where the m’ elements of 9) 


'C. F. Gauss, Werke, I (Zweiter Abdruck, Gottingen, 1870); p. 369 and p. 466. 
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and the (m(m + 1))/2 independent elements of T are taken as rectangular 
Cartesian coordinates, and let T converge to the single point S. It is known 
that 


(3) lim v(Yo)/o(T) = p(S) 

exists, except in the trivial case of a rationally decomposable binary form; in 
this case we define p(S) = 0. Our object is the proof of the following 


TurorEeM: Let S run through a system of representatives of all classes of given 
signature n, m — n, whose determinants have the absolute value S < N; then 


In the definite case, i.e., for n = 0 and n = m, this theorem was already given 
by Minkowski,” in a slightly different form. In this case, 


(5) p(S) 1 Il 
E(S) r G) 
2 


where E(S) denotes the order of the group of units of S, and it follows from (4) 
by partial summation that 


2 m 


SSN 


say. On the other hand, E(S) = 2, except when © is equivalent to a matrix 
on the frontier of the reduced domain, and it is easily seen that the sum of the 
class numbers H,,(S) of all positive quadratic forms of m variables and deter- 
minant S < N is asymptotically equal to the left-hand member in (6). Conse- 
quently, 


and this is Minkowski’s formula; vice versa, (4) follows from (7) by partial 
summation. As a matter of fact, Minkowski established a better result than 
(7), namely the estimate O(N”) (m > 2), O(N log N) (m = 2) for the difference 
of both sides in (7). Since the proof of (4) is decidedly more difficult in the 
indefinite case 0 < n < m, we are interested in the simplest appraisal of the 
error term which is sufficient for the proof of the Theorem, and this leads only 
to the estimate o(N) for the difference of both sides in (4). It is possible to ob- 
tain the estimate O(N*) (m > 2), O(N* log N) (m = 2) also in the case of an 
arbitrary signature; however, this is rather complicated for m = 3, and we shall 
derive this more precise result only in case m = 2. 


*H. Minxowsk1, Diskontinuitdtsbereich fiir arithmetische Aquivalenz, Journal fiir die 
reine und angewandte Mathematik 129, pp. 220-274 (1905). 
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I. Brvary Forms 
3. Let x(k) be a non-principal character modulo g and define 


= (n = 1, 2, 
k=1 


Ithas been proved by Pélya® for proper characters, and by Landau‘ ia the general 
case that 

8) | | < log g, 

where c is an absolute constant. Let N be a positive integer; then 


and, by (8), : 


In particular, the Legendre-Jacobi-Kronecker symbol (‘) is a non-principal 


character modulo | d| , whenever d = 0 or 1 (mod 4) and not a square number. 
It is well known that 


(10) log = (d > 0). 
We introduce the abbreviations 


dha log ea = fa, = 


it follows from (9) and (10) that 
(11) | fa oa | log d, 
for all positive d = 0 or 1 (mod 4) which are no squares. On the other hand, 


N 
(12) <1 + log N, 


also when d is a square. 

*G. Pétya, Uber die Verteilung der quadratischen Reste und Nichtreste, Nachrichten 
von der Kéniglichen Gesellschaft der Wissenschaften zu Géttingen, Mathematisch-physi- 
kalische Klasse, Jahrgang’ 1918, pp. 21-29. 

‘E. Lanpav, Abschdtzung von Charaktersummen, Einheiten und Klassenzahlen, Nach- 
tichten von der Kéniglichen Gesellschaft der Wissenschaften zu Gottingen, Mathematisch- 


Physikalische Klasse, Jahrgang 1918, pp. 79-97. 
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| Let r be either 0 or 1; let ¢ run through all numbers of the given residue class 
at t = r (mod 4) which lie in the interval 1 < ¢ < N, and let d run through the set 
if of all those ¢ which are no square numbers. By (11) and (12), 


(13) fa = )+ O(N?" log N) (N > «), 

Define A 
(14) = P,(n) 
and assume first that n is not a square. If n is even, then P,(n) = 0. If n 3 
is odd, then xi(k) = (): is a character modulo n, and not the principal character; a 
consequently, by (8), 


Po(n) = xi(k) = log nO(1). 


Let | denote the highest power of 2 dividing n, then n/l = s is odd and s = 


ters modulo 4n, both different from the ei character, we obtain, again by 
(8), 


Pr(n) = + = log n0(1). 


_ Hence 
(15) P,(n) = log nO(1), 


forr =0,landi <N,n 
Next let n = u’ be asquare. Then P,(n) equals the number of integers in the 
interval 1 < ¢ S N which are prime to u and = r (mod 4). Therefore 


P,(ui) = + O(1) (um odd), 


= N + (uz even). 


It follows that 
+ o(us) + O(log N) 
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By (13), (14), (15), (16), 
+ O(n! log N) 


2r—-1 
¢ )+ O(N? log N). 


Applying partial summation, we infer that 
—1 
(18) d'f, = 36¢@) 2 1) + O(N log N), 


where d runs through all integers in the interval 1 < t S$ N which are = r (mod 4) 
and no squares. Since d'fz = ha log ¢,, the statement (1) follows from (18), 
forr = 0; on the other hand, we get (2) by using (18), for r = 0, 1, and adding 
the two formulas. 


(17) 


4, The analogue of (10) for positive primitive quadratic forms with discrimi- 
nant d is the formula 


where w_3 = 6, w4 = 4, wa = 2 ford < —4. Using exactly the same method 
as in the preceding section, we get 


aN 2r—1 
(20) he = ( )+ O(N log N) (r = 0,1), 
/2 
/2 


corresponding to (17), (1), (2). The second of these three formulas was already 
stated by Gauss, without proof. 

Let Ha denote the number of classes of all positive quadratic forms Q(z, y) = 
ax’ + bey + cy* with integral coefficients a, b, c and discriminant b° — 4ac = d; 
then Hz = >> h, , where ¢ runs through all divisors of d such that dt~ is a square. 

t 


By (22), 
(23) Hs = + log N); 


vice versa, (22) again follows from (23) by an application of the Mébius inversion 
formula. 

There exists in every class of positive quadratic forms Q(z, y) a reduced form, 
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: satisfying the condition |b| < a S c; the reduced form is unique whenever I 
! |b| <a<c. This remark suggests the idea of proving (23) directly, without 
He using the class number formula (19), by a computation of the lattice points in 
; the three-dimensional domain | 7| < — < ¢,0 < 4& — n < T, for sufficiently 
At large 7. In this manner (21) was already proved by Mertens.’ It is possible 
i to apply a corresponding idea also in the indefinite case; this leads to the quad- 
ruple integral J introduced in the next section. 


—_ 


Vf: 5. Let Q = Q(z, y) = ax’ + bay + cy’ be an indefinite quadratic form with 
id arbitrary real coefficients, a ~ 0, b” — 4ac = D> 0. If px, pe denote the roots 
i of the quadratic equation Q(z, 1) = 0, then Q = a(x — pi y)(x — py) and 
if D = a'(o: — px)’. We determine the order of p: , p2 by the condition a(p; — 2) = 

ih +/D, with the positive sign of the square root; then p: > p2 for a > 0. 

if Let \ be a positive parameter and define the positive quadratic form 


(24) P= P(z,y) =|a| — my)” + — = + Bary + yy’. | 


If +r = & + in denotes the root of P(x, 1) = 0 with positive imaginary part, then 


+ = +7) is pure imaginary; consequently, for variable \, these points r 


T — pe 
describe the half-circle H through p; , p2 in the upper half-plane, with the equation 
(25) af +7) 


‘Plainly \ is the tangent of the angle between the diameter from pe to p; and the 
segment from pe to r. 

The given indefinite quadratic form Q is called reduced, when and only when | 
there exists at least one \ > 0 such that P is reduced; this means that 2|8| < 
a Sy, or in other words, that 7 lies in the fundamental domain F of the modular 
group, defined by the inequalities —} < < + 7° = 1. The intersection 
of H and F is an are A; consequently, for every given Q, the set of all \ with 
reduced P is either an interval 4: S \ S d: or empty. We introduce the non- 
Euclidean line element ds = n (dg + dy’)'; then the length of A is 


(26) w= = log 


if A is not empty, and » = 0 otherwise. 


Consider the integral 
[[udadac, 


c with a > Oand 0 < — 4ac = D <1. 


5 F. Mertens, Ueber einige asymptotische Gesetze der Zahlentheorie, Journal fir die 
reine und angewandte Mathematik 77, pp. 289-338 (1874). 
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In view of the definition (26) of u we have 


rerF 


for any given A > O the inner integral designates the volume of the domain 
R, of all a, b, c with a > 0,0 < D < 1 and + lying in F. Instead of a, b, c 
we introduce by (24) the new variables of integration a, 8, y. If » is fixed, then 
P is uniquely determined by Q. On the other hand, it follows from the geo- 
metrical meaning of \ and 7, that the roots p: , p: are uniquely determined by P 
since ay — 8° = Dand a > 0, the indefinite form Q is again uniquely determined 
by P, when ) is fixed. Consequently the domain R, is mapped onto the domain 
Gof all reduced P with ay — 6° <1. This domain G is independent of X. 

By (24), = +A,a = — Am, = X ‘pi + Apr, whence 


moreover = — pp, @ C= pip, 
d(a'b, ac) _|—1—1 
From (27) and (28) we get the value of the Jacobian 


(28) = — pi. 


d(a, B, 7) 
d(a, b,c) —207 +2). 


Therefore 


where V denotes the volume of G. In order to calculate V, we choose the new 
variables of integration D, £, 7, instead of a, 6, y. Since a6 = -t,a y = 
+1, D = ay — 6 = (an), we obtain 


d(a™*B, = —2n,  2and(an) = dD, 


d(é, ») 
(30) v=[ Dt dD % 
By (29) and (30), 
(31) 
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6. The half-circle H passes through F when and only when one of the two 
vertices }(-+1 + ~/—3) of F belongs to the half-circle domain & + 9° + a ‘be + 
a'c $0,720. Therefore a reduced form Q is defined by the condition a + ¢ < 
4|b|, in case a > 0. Since 

(|b| — a)’ + = 3(4a — |b |)’ + = D+ 4a(a + 
we obtain the inequalities 
(92) SD, US4D, 4alc| 
for any reduced Q with a > 0. Moreover, the positive form P is reduced for 
all in the interval 4, S S de, and then 30° = a’ — ay — =D, 


(33) a+») =as24/2 On SA Sm). 


For any ? > 0 let Ps denote the domain of all reduced Q satisfying the con- 
ditions a = 3, D S 1; in view of (32), Ps is closed and bounded; ia (26) and 
(33), the function » is continuous in Po. Define 


Ps 
then 
(34) J =limJ,. 
0-0 
On the other hand, u(ga, gb, gc) = u(a, b, c) for any g ¥ 0; therefore 
(35) Jo = lim N™ b, 


where the summation is carried over the set S; of all reduced Q with integral 
coefficients satisfying the conditions a = 8N, b’ — 4ac = D < N’ 
We consider first all Q in S; whose discriminant D = h’ is a square. Then 
= (b + h)(b — h) and 0 <h N; by (32), a’ 4N’, 0’ S $N’. For any 
pair b, h with b ¥ +h, the number of divisors of (b + h)(b — h) is o(N); on the 
other hand, in case b = +h, we have c = 0 and a’ < 4N’. Consequently the 
number of all these Q is N’o(N) + NO(N) = o(N’*). Since u(a, b, c) is bounded, 
for any fixed #, uniformly with respect to N, it follows that (35) holds good if 
we cancel in S; all Q with D = h’,h = 1,2,---,N. 
Next consider the set So of all reduced Q with integral a, b, c satisfying 0 < 
a <08N,0<D</N’. The points of the arc A attain the minimum of their 


imaginary part 7 at one of the end points of A; let £ + im be this end point. 
On the half-circle H we have 
ah _ dr (p1 — ps)dr 


(01 — px)(2r — _ — —D' dt. 
(7 — pr)(t — — 7) (7 — 7? 


(36) 
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hence 
» 
(36) u(a, b,c) S es 
On the other hand, by (33), 
flog < log? 4/2 < log Sm) 
(37) p(a, 6, c) < 2log 


For any integral & the number of integers a in the interval (#N)/2*" < a < 
(0N)/2* is less than (8N)/2*. Since b’ < (4/3)N’, we obtain, by (37), 


(38) u(a, b, c) = O(N’) OF log = log '0(N’), 

uniformly in #. Moreover, by (25) and (382), 

an = — b& —c 
whence c < N, and 

provided |c| = 2N. In virtue of (32) and (36), 
(39) wa by 0) = O(N") Le 


of <a < ON, 4ac S N’,c 2 Q2N. 
Obviously, - 


N —1 
(40) = O( toe x) = log #'0(N). 
By (38), (39), (40), 
(41) n= » u(a, = blog 


Finally, let S be the set of all reduced Q with integral a, b, c such that the 
discriminant b° — 4ac < N* and not a square number. If Q belongs to S, then 


also -Q; moreover, a ~ 0. By (35) and (41), 


(42) lim sup — N~* u(a, b,c) | = log 


If 0, then log 0 and Js — J, by (84). Bio 6, 
we infer from (31) and (42) that 


(43) lim p(a, b,c) = 2J = 
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7. Let Q be primitive, ie., (a, b, c) = 1. It is well known that the matrix 
M of ever linear transformation z — px + qy, y > rx + sy of Q into itself, with 
integral p, gq, r, s and ps — gr = +1, has the form | 


M = + = 0, +1, +2, --.), 


where t, u is the smallest positive integral solution of  — Du? = 4. The cor- 
responding modular substitution has the fixed points p: , 2 and leaves the half- 
circle H invariant. A fundamental domain on H for the cyclic group of these 
modular substitutions is given by any arc B on H with the non-Euclidean length 
2 log €p, where = 

Let = + + = 1, , g) denote all reduced forms which 
are equivalent, in the narrow sense, with the given primitive form Q, and let A; 
denote the arc A for the particular form Q;. It follows from the definition of 
A and B, that the arcs Ai, --- , Ag are equivalent to certain arcs on the half- 
circle H which cover B without gaps and overlappings. Consequently, 


k=1 
Summing over all primitive Q with given discriminant D, we obtain 
(44) u(a, b, c) = 2hp log €p 


b2—4ac= 
(a,b,c)=1 


where hp denotes the class number. 
By (43) and (44), 


2 
(45) ho log 18 
where D, g run through all positive integral solutions of Dd < NW’ and D is not 
a square. Denoting the left-hand member in (45) by g(N), we get 


1 r n\ 3 

This result differs from (2) only in the estimation of the error term, viz., o(N*) 
instead of O(N’ log NV). It is possible to obtain this better appraisal of the error 
term by defining the parameter 3 of §6 in a suitable way as a function of N. 
In order to prove also formula (1) by our second method, we cancel in the sum 
on the right-hand side of (35) all terms in which either b is odd or a, b, c are all 
even. The remaining triplets a, b, c constitute exactly 3 systems of residue 
classes modulo 2; consequently we have then to multiply the left-hand member 
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in (85) by the factor 3/8. Since b* — 4ac = D = 4kis divisible by 4 and (a, b, c) 
is odd,we obtain instead of (45) the relationship 


where k, q run through all positive integral solutions of 4kq” < N* with odd ¢ 
and k is not a square. It follows that 


r 2n\' 


and this is the formula of Gauss. 


II. Forms or m VARIABLES 


8. Let S be a real symmetric matrix of signature n, m — n, and let the ab- 
solute value of the determinant |@| be S > 0. We denote by H the set of all 
positive real symmetric § which satisfy the condition 5S" = S. It is known 
that H is a manifold of n(m — n) dimensions with the parametric representation 


= - 6 BW = Sf] > 0, 


where ¥ is a variable real matrix with m rows, n columns, rank n and positive 
[x]; two such matrices ¥ = %, , ¥. represent the same point of H, when and 


only when X. = XR with real n-rowed &. 

Let F be the domain of the reduced positive quadratic forms of m variables, 
in the definition of Minkowski. Moreover, let W be a bounded domain in the 
space of the positive quadratic forms of n variables and denote by X the set of 


all which fulfill the two conditions HeF. We define 

po = 1, = iT (3) (k = 1, 2, 
and 


Let & be the matrix formed by the last n rows of ¥ and put 4% = (2) = 
3, S[3] = U, then = 2U-[3’S] — S, U = S[3Z] > 0, and 


where Y is the set of all 9) satisfying 11 > 0, S¢F. This proves that u(S) is 
independent of W. In the particular case of an indefinite binary form ©[r] = 

ari + bay, + cz? it is easily seen that 2u(S) has the value yu in (26), provided F 
is defined with respect to the narrow unimodular group; however, we shall now 
assume that equivalence is understood in the wide sense. 
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We set 

(48) J = dS, 


the integration extended over the space of all © with 0 < S < 1 and given sig- 
nature n,m — n. By (47), 


(49) J { [isis penn ach 


the outer integration is carried-over the space F of all real ¥ with rank n, the 
inner integration is restricted by the three conditions 0 < S < 1, @WeW, SeF, 
for any given % in R. 

By the substitution (46) we introduce the new variable of integration § 
instead of ©, for fixed ¥. Since S¥ = HX, we obtain 


(50) = W'S] §; 


vice versa, (46) follows from (50). Consequently (46) defines a birational map- 
ping of the space of all © with the given signature onto the space of the positive 
. In view of the relationship $5" - GS, we can determine a real matrix 
€ such that S[C] = E and S[C] = CS ext) = ©, say, for any given 


pair S, . This proves that the Jacobian dS/dS = j depends only upon m 
and n; therefore it suffices to compute j at the particular point § = €, S = G. 
It follows from the formula 6¥ = XX that then all elements of the last n rows 


(n) 
of the ned Yared. 22), then 


(G Sz _ (dS, 
$= 257'[Su] — by (46), and = 
for Si = 0, whence j = 
By (49), (50), 


IS1<1 
on the other hand, 


| ay — fm | dk. 
Pm—n 


Consequently, using Minkowski’s formula for the volume of the domain of the 
reduced with | | < 1, we obtain 


61) J=om = Te. 
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9, Let 81 = 8 be the first diagonal element of S = (s,:). For any 3 > 0 
we define 


/ u(S)S*-™* as, 


abs 
0<8<1 
By (48), 
(52) J =limJs. 


The real symmetric matrix © is called reduced whenever there exists a solution 
§of 6S"S = Sin the domain F. If u(S) ¥ 0, then necessarily S is reduced, 
in view of definition (47). Let ,---,hm be the diagonal elements of §; 
since © + S and S — S are non-negative, it follows that st: < huh (k,l = 1, 
If les in F, then hy and the m — 1 ratios 
h»-1/hm are bounded, Consequently, if moreover s° > 3° and 0 < S < 1, then 
§, S and S” are bounded, for any given #, and u«(S) is continuous, by (47). 
Since n(qS) = u(S), for all scalar factors g > 0, we infer from the definition of 


an integral that 


(53) Jo=limN* wos”, 
Ne abs > 


where © runs through all integral symmetric matrices of signature n,m — n 
with abs s > #N”™ and S < N. Performing the passage to the limit 3 — 0, 
we obtain, by (52) and (53), 


(54) J < lim inf > «(S)s*”. 
SSN 
For any given integral S there exist only a finite number of equivalent reduced 
matrices G,,---, ©,. It is known that 
(65) 


where p(G) is defined by (3), except in the case m = 2,n = 1, Sasquare. By 
(51), (54), (55), 


(66) lim inf N* So) = 5 
SSN ka? 
in the left-hand member © runs over a system of representatives of all classes 
with signature n, m — n whose determinants have the absolute values S S$ N. 
In our proof of (56) we have tacitly assumed, in some places, that 0 < n < m. 
However, in the definite case n = 0 or m, formula (56) follows directly from (5), 
(51) and the definition of an integral. 
If it is true that the two passages to the limit N — », # — 0 can be inter- 
changed in the right-hand member of (53), then we should obtain from (53) 
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statement (4) of the Theorem, instead of the much weaker result (56). However, 
the direct proof of the legitimacy of this interchange seems to be very compli- 
cated in the general indefinite case. For binary forms, we overcame this diffi- 
culty in §6 by using the estimates (36) and (37) for »; but the corresponding way 
becomes impractical in case m = 3. In order to get the desired result we shall 
also apply the generalization of the class number formulas (10) and (19), viz., 
the formula for the measure of a genus. Therefore our proof of the Theorem 
will be a combination of the ideas of the two different proofs given in the first 
Chapter. 

The Theorem is trivial in case m = 1; for m = 2, it is easily derived from (17) 
and (20), even with the estimate O(N’ log N) of the error term. For the rest of 
the paper we shall assume that m = 3. 


10. Let S:,--- , SG, denote a complete system of representatives of the 
classes in the genus of © and define i 
h 
(57) = (i). 
It is known that 
(m—1) /2 
58 
(58) v(S) II EAS) 


where p runs through all primes, g = p’‘ is a sufficiently large power of pand E,(S) 
is the order of the group of units of S modulo g. If p’ is the highest power of p 
dividing 2S, then ¢ may be any integer >2b; in particular, for b = 0, i.e., (p, 
2S) = 1, we have the explicit formula 


(m—1)/2 


(1 — IT (l1—p™) (meven), 


_1)\m/2 
where 6 denotes the Legendre symbol (* |S ' 
Let N = 8,8 < N and 
(60) Q = J] 
Pp 


then < 28 < 2N, whence b < EH < | and 
log p log p log p 
Defining 


m(m—1) /2 ra (m—1) /2 
(61) 2p 2 1) 


E,(S) ’ ES) ’ 
where x(N) denotes the number of primes < N, we obtain 


a(S) = 
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by (68), (59) and (60). 
On the other hand, the number of modulo Q incongruent integral m-rowed 
matrices @ with (|@|, Q) = 1is 


(s) A=4e=Q" TI [la 
PSN k=1 


Obviously the ratio A/E 9(S) is the number of symmetric &) modulo Q which 
are equivalent to S modulo Q. For these @ we have 

| Sol =|S|2*(mod Q), Q) = 1; 
it follows that the number of determinants | S) | modulo Q equals the number 
of quadratic residues xz” modulo SQ which are prime to Q; this number has the 
value 
(64) B= 277™s"Q I] 

PSN * 


Consequently the ratio A/BE 9(S) is the number of symmetric @) modulo Q 
fulfilling the two conditions SG ~ S (mod Q), | So | = | S| (mod Q). 

Finally, let S run through a system of representatives of all genera with signa- 
ture n, m — n and given determinant |S |= (—1)” "S. We denote by C,(S) 
the number of residue classes modulo Q which contain an integral symmetric 
matrix with this signature and determinant. By (61), (63), (64), 


11. By the reciprocity formula for Gaussian sums we have 


t(mod Q) 
Define m = n — 2forn = 2and m = n+ 2 forn < 2; thenO S m S m, in 
view of m => 3and0<n<™m. Since = — and (—1)" = (—1)”, it 


follows that two symmetric matrices with the same determinant (—1)” "S and 
the different signatures n,m — n and m , m — no never belong to the same residue 
class modulo Q. Consequently the sum C,,(S) + C,,(S) is at most equal to the 
number D(S) of modulo Q incongruent integral symmetric matrices ©, satisfying 
|= (—1)"""S(mod Q). 

Let S and & run through a system of representatives of all classes of integral 
symmetric matrices of signatures n,m — n and nm, m — no whose determinants 
have an absolute value S < N. Since 


=a+ow'y 


DSN k=2 


We infer from (57), (62), (65) the inequality 


ver, (2) = 2a(S)a(S),  w(S) = 1 + O(N), 4 | 
pli- 
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In the remaining sections we shall prove that 


Plainly the Theorem is an immediate consequence of (56), (66) and (67). 
12. Define 


e(z) (2) @> 0, (9) = 1); 
q S(mod q) 
then 


l=1 S(mod Q) 


aj@ 


abs < Min (w, 
s=1 sin ml 
q 
and 


1 


_ Consequently, in order to prove (67), it suffices to deduce the estimate 
N 


where 8, denotes the maximum of the ¢(qg) numbers abs () g=1, 
g; g) = 1). 
It is easily seen that 
(69) = = 1). 


It remains to determine an upper appraisal of 8, in case g = p(t = 1, 2, --: ). 
Let p { 2a and consider all residue classes of symmetric S modulo p satisfying 
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|| = a(mod p). Any two such © are equivalent modulo p; in virtue of (59) ! | 
and (63), their number has the value a 


Hence 
a() 4. Wall) = 09", 
(70) PBp = O(p’), 


1); for the set of all primes p. 
Next let g = p’,t = 2,b = [(¢ + 1)/2], w = p’; let S, and G run through 
complete systems of residues modulo q and g/q. If Go = (siz) denotes the 
adjoint matrix of ©, , then 


| Si + qoSe | = | Si] + qoo(SoSe) (mod q), 
whence 


The inner sum vanishes, except when g/g is a common divisor of all numbers | + 
Sie, 28u(k, 1 = 1, , m), and then it has the value If the latter 
case occurs exactly F, times, then 


(72) Bo < 


13. Let p*' be the highest power of the prime p dividing all elements of ©; 4 ¥ 
and go; if a = 0, then G; is called primitive modulo gq. Put gp “* = q:; plainly, ae 
&, = p“ where is modulo q; primitive and uniquely determined. Suppose 
a <b,ie.,p!qi.. If all diagonal elements of T = (t:) are divisible by p, then 
there exists an element ty , with & ¥ 1, which is not a multiple of p, and the same 
holds for the principal minor tu — tj. Consequently there exist principal 
minors in € which are not divisible by p; let 7: be the maximal order of these 
minors. After a suitable permutation in the rows and the columns of T we 
obtain 


| (73) TG] = 


where ©, is the matrix of the permutation, T, = {?,p X |G]. Let Abean 
integral matrix satisfying T,% = € (mod gq) and set 


then 


| 
| 
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(75) C2] = p™ (mod q) 


with integral 

We denote by p” (1 = 1, --- , m) the elementary divisors of ©; modulo q, 
such that 0 S w, < --- S wm S b; moreover, let a, --- , a, be the different 
values of the w:, where 0 S a < a, < ++: < a S b, and denote by r; the mul- 
tiplicity of a in the sequence w, --* , wm; plainly, a; and 7; have their former 
meaning. The (m — r;)-rowed matrix T) has modulo gq; the elementary divisors 
=n+1,--:-, m). 

Let F(qo; w1, @m) be the number of modulo incongruent with given 
w1,°** @m, and put w = 0,7: = w — wa (l = We shall prove 
by induction that 


where c, depends only upon m. The assertion (76) is empty in case m = 0. 
Using (76) for m — 7, qi instead of m, go, we obtain 


where 

MED, 


moreover, the number of modulo q, incongruent (3,Ti2) in (73) is at most 
gi” “» the number of permutation matrices ©, is m!. By (73), (74), (75), 
(77), (78), 


go (Qo; O1,°*"* Wm) < m! s Cap’; 
with cn = m! Max cm_,,; and this is the assertion (76). 

Suppose now that Go = (s,) is the adjoint matrix of ©, and that ¢/q = 
p’ ’ is a common factor of all numbers sy, , 28x (k, 1 = 1, ---, m); then, a for- 
tiori, 2p” "Sp is integral; hence t — b + --- + wma for p ¥ 2andit—b<s 
@ + +++ + @m.a + 1 for p = 2. On the other hand, 


and Ul + 1) — 6(1 — 1) = (l— 2)(1-— 3) 2 0 (1 = 1, --- , m), where the 
equality sign holds only for 1 = 2, 3; therefore w, + --- + wm1 < 4, with 
the equality sign only when y; = 0 for 1 ¥ m — 1, m — 2. It follows that 
3 
(79) (*) Bg 


in case t = 3, p ¥ 2, we have 
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1 
(80) 


exept when y: = for 1 # m — 1, m — 2and ym4 + 2ym-2 = 1. Plainly this 
exception is the case w = O(/ = 1, --- ,m — 2), wma = wm = 1. is 


14. The number of systems of integers w , --- , wm satisfying 0 < w < -:- 
Sum < bis < (0 +1)". By (71), (72), (76), (79) and (80), we obtain the in- 


equalities 
(61) qBe < = = 2,3, | 
(82) < O(p™) abs €q (|G |) (q = 


where ©, runs through all symmetric matrices modulo p’ with w, = = = 
0, ®m-1 > 0. In order to estimate this last sum, we put S, = #1 + p92, where | 
f,runs over all residue classes modulo p and §; over all residue classes mod- 
wlo p with = 0, wm1 = 1. The number of these is 
by (76). Let 9: be given and determine a unimodular ©, according to (75), ty 


(m—2) 
such that = (mod p’), = (5 4 with integral a,b. Put 


(mod p'), p | Ei], | We + = @+ a)(e +b) — Obviously, 


except when 2 = —b (mod p). If 9%: is given, then the number of 9: with z = | 


-b (mod p) is Consequently, 

and 
by (82). 

It follows from (70), (81), (83) that 
(84) = O(p’) + yo) = O(p"’). 


In view of (60), (69), (84), 

om). O(log log N) 
Ye a+0p%) =e =e 
PSN 
o(1) N 
= (log N) 
and this is the assertion (68). The proof of the Theorem is now complete. 
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GROUP INVARIANCE OF CAUCHY’S FORMULA IN SEVERAL 
VARIABLES 


By S. Bocuner 
(Received February 1, 1944) 
1. If we iterate Cauchy’s formula 
(1) fe) = [ 


c 
we obtain 


In (2), C;, is a k-dimensional “characteristic manifold” of a meee topological 
type, its prototype being the torus 


(3) 1, Se] = 1. 


The special nature of this manifold corresponds to the special nature of the kernel | 


1 
— 21) +++ (Se — 


under the integral, and S. Bergmann’ and A. Weil’ have studied alterations of 
the formula to be brought about by suitably modifying both C; and G(f; z). 

In the present paper we will deal with the same question but from a different 
approach. In the work of Bergmann and Weil the emphasis is put on obtaining 
a formula to suit a prescribed domain of validity in z, whereas our emphasis 
will be on retaining prescribed structural features of the formula as such. We 
note that in the special case (4), the kernel depends only on the differences 
$1 — 4,°*:, %% — &, and this feature will be retained unaltered. We further 
note that the torus (3) is invariant and transitive under the group of trans- 
formations 


(4) Gg; z) = 


This feature will be likewise retained but other groups of affine transformations 
will be admitted. 
Part I will deal with the group of translations, Part II with some cases of the 
unitary group, and Part III with general groups. The general case will be based 


1 On the surface integrals of functions of two complex variables, Amer. Jour. of Math. 63 
(1941), 295-318. 


* L’integrale de Cauchy et les fonctions de plusieurs variables, Math. ‘Asin, 111 (1935), 
178-182. 
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on an operational version of Cauchy’s theorem in which no reference is made to 
“realization” by integrals. 


Parr I 
OcTANT-SHAPED DoMAINS 
2. Tubes 
Given any k complex variables 
(5) Zp = Lp + typ; p=1,-::,k, 


we consider a k-dimensional domain P in the space of the real variables x = 
(1, °**, %), and with the point set P we construct in the (2k)-dimensional 
space of the variables (5) the domain 

(6) <yp< ~,p=1,-:-, k}. 

Such a domain has been called a tube,* and the point set P its base. A tube will 
be usually denoted by the letter 7. We will say that the domain P is radiated 
if it is convex, does not contain the origin « = (0, --- , 0) but is composed of 
rays which emanate from the origin. That is, if z° = (21, --- , zg) is a point of 
P, then the segment 

(7) O<t<o 


also belongs to P. We now pick another set of & real variables denoting them 
by a = (a1, +++ , a.) and we consider all those a for which the inequality 


(8) + + apt, > 


holds for all x ¢ P.. They obviously form a convex set. We make the explicit 
assumption that the set is not empty and that it actually has an interior thus 
being the closure of a domain. This (open) domain will be denoted by G and 
called the conjugate to P. The domain G is again radiated. We now set up 
the integral 

G 


where dv, = da, «++ do, and we make the final explicit assumption that this 
integral is boundedly convergent in every compact subset of P. If all these 
assumptions are fulfilled we will say that the tube T is octant-shaped, and we 
associate with it the “kernel” 


(10) K(a, %) = dv. 


which is obviously defined and analytic in all of 7. 


*§, Bocuner, Bounded analytic functions in several variables and multiple Laplace inte- 
grals, Amer. Jour. of Math. 59 (1937), 732-738. 
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The boundary of T is likewise ‘‘tubular’’, and the most important part (for us) 
of the boundary will be the pointset 
(11) {tp =O + tmp; < =1,---, kh}. 


It will be called the spine of T. The Euclidean volume element on the spine is, 
of course, dun = dm --- dm, ; however there will be a formal advantage in using 
not the volume element as such but the “‘oriented’”’ external differential 


(12) df +++ dt, = idm +++ dm = 


and whenever this differential will occur under the integral, the spine as a mani- 
fold will be denoted by 7H; . 


3. Functions of integrable square* 


Functions in tubes can be subjected to Fourier analysis. If f(z) is analytic 
in T then we say that it is of integrable square, or belongs to Lz, , if there exists 
a finite constant M = sys such that 


(13) aap +) |? doy 


for all x in P (and not only for every compact subset of P). If (13) holds then 


f(x + ty), for each x, can be represented as a Fourier integral in y, and these 
representations coalesce into an absolutely convergent Laplace-integral 


The function g(a) is measurable, and by the equality of Parseval-Plancherel 
we have 


15) ff Lola) = + a) 


therefore 
(16) [ele ane S My. 


If we now recall the definition of the conjugate set G, and if we realize that the 
Lebesgue measure of the boundary of G is zero (since G is convex), then a simple 
argument will show that the function g(a) must be zero, almost everywhere, 
outside the conjugate domain G. Thus, for instance, we can rewrite relations 
(14) and (16) into 


(17) fe) = o(a) dow 


and 


| 
(18) 
holds 
| (19) 
By 
(20) 
(21) 
| | 
| | (22 
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(1) | o(a) | < 


Since z = (0, «-- , 0) is a boundary point of P and for all points of P relation (8) 
holds, wé may now derive from (18) the limit relations 


[ don My 
and 
20) tim | ele) [2 dng = 0. 
The first implies the existence of a function of integrable square 
a1) Hin) ~ de 
G 
and the second establishes the limit relation 
(22) tin + in) |? do, = 0. 


Thus our function f(z) has automatically generalized boundary values (in the 
strong topology of Lz) almost everywhere on the spine of 7’; aiso, these boundary 
values are identical with the limit-values 


lim f(z + in) 


whenever such limit values exist on a measurable set of the spine. 
Now, for each z in 7’, K(z — in) is a function of integrable square in », namely 


(23) — in) ~ dy, 

By the theory of Fourier convolutions (resultants), relations (21) and (23) imply 
the relation 


If we combine this with (17) and recall (12), we finally obtain the following 


theorem. 
TuzorEM 1. Jf T is an octant-shaped tube and K(z) is its kernel as defined by 


(10), then for functions f(z) of class L, we have the formula 


for all z in T. 
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The most obvious case of an octant-shaped domain is the tube 
Its basis is ‘‘self-conjugate” and the kernal is 1/z, --- z,._ However, a more sig- 
nificant type of domain, also self-conjugate, has come to the fore recently and 
we will discuss it in some detail. 


4. Matrices‘ 


If & = n’, then the variables 
(26) 
can be re-indexed into a square scheme 


(27) 2pq » 


The space of these independent complex variables will be called the non-sym- 
metric (matrix) space. The letter z will sometimes denote the set of variables 
in either indexing, but most of the time it will denote the matrix (27) as an 
algebraic concept. 

If the matrix is symmetric, zp, = 2,» , then we choose the elements 


2115 125 °** » Zin 
(28) Zany Zan 


Znn 


as independent complex variables, and the letter z will again denote either the 
set of these variables or the resulting matrix. In this case k = n(n + 1)/2, 
and the space will be called the symmetric space. 

We will require a lemma. 


Lemma 1. In non-symmetric space, if a and b are arbitrary constant matrices, 
then the Jacobian 


_ Wr) 
+++ , 2x) 


of the linear transformation 


(29) a w = bea 
has the value 
(30) J = (det | ba] )”. 


In symmetric space, if a is an arbitrary matrix and a’ is its transposed, then the 


_ Jacobian of the transformation 


4 See C. L. S1eGe., Symplectic geometry, Amer. Jour. of Math. 65 (1943), 8-10. 
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(31) w = a’za 
has the value 
(32) (det | 


Proor. Since the Jacobian of successive transformations is multiplicative, 
it is sufficient to verify (30) for the special cases: a = 1 and b arbitrary, or b = 1 
and a arbitrary, and we will only verify the second case. Thus we have to prove 
that 
(33) J(a) = (det | 
for the transformation w = za. In the first place, (33) can be verified by direct 

computation for a diagonal matrix a. In the second place, we have J(b™'cb) = 

J(")J(e)J(b) = J(c), and thirdly we have det | b-'cb | = det |c|. Thus, (33) 
holds for any a of the form b™'cb with c diagonal. The general case follows now 
by analytic continuation in the elements of the matrix a. 

In the symmetric case we have to assume b = a’ in order to make sure that a 
symmetric z will transform into a symmetric w. The proof proceeds along the 
same lines as before. 

5. The symmetric case 


We write z = x + iy, where x = {2,,} is the matrix of the real parts and y = 
{Yn} is the matrix of the imaginary parts of the variables (28). The matrices 
cand y are real and symmetric, and hence hermitian. We now consider the set 
of those matrices x whose roots are all strictly positive, in symbols x > 0. The 
corresponding pointset will hence forth be denoted by P%, and the tube with P°, 
as base will be denoted by 7%,. It is easily seen that P%, is a radiated set. We 
now take another real symmetric matrix 8 = {8,,} in which again (n(n + 1))/2 
elements are variable, and we set up the familiar expression 


(34) tr(zg) = « 


In the double sum each non-diagonal term occurs twice, and it can be written 
as ta, + -++ + aan where each a; is a Bp, or double a Bp, ; also 

(35) = dog. 

Relation (8) states that tr(zé) > 0; and the latter will hold for all z > 0 if and 
only if 8 > O (as a matrix). Thus the conjugate domain G consist again of all 
positive matrices, and we will simply denote it likewise by P*,. For the kernel, 
if any, we now have the equation 

(36) Kia) = dy, 


This integral occurs in the work of Siegel.° He actually considers the more 
general integral 


°C. L. Sreee., Ueber die analytische Theorie der quadratischen Formen, Annals of Math. 
36 (1935), 585, Hilfsatz 37. 
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and he shows that it has the value 
(38) (det | II 


(We will indicate the process of evaluation in the discussion of the non-symmetric 
case.) If we now puts = (n + 1)/2, then Theorem | implies the following theo- 
rem: \, 

TuEoreEM 2. In the symmetric case we have the formula 


+++ dg 
where 


2 3 n+1 
(40) GPG) 
a= On 
for functions of integrable square in T", . 
We remark that for even n, the value of the denominator 
(det, | — |)” 
is to be obtained by analytic continuation, which is easily proven to be unique, 
from its initial value +1, for z = 1 (matrix), ¢ = 0 (matrix). 


6. The non-symmetric case 


The analysis is now more involved than in the symmetric case, however the 
final formula will be even simpler in structure. We will employ two systems of 
variables which we will denote by {z,,} and {Z,,} respectively. They shall be 
connected by the affine transformation 


Zop = p=1,:::,k, 


If we introduce the real and imaginary parts of all variables, zpg = pq + 1Yp05 
= Xa + then we have the relations 
Xpp = Yop = Yop 


Xqp = — Yo Vo = 
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where again p <q. The coordinates z,, will be called primary coordinates, and 
the coordinates Z,, will be called secondary coordinates, and we will employ both 
concurrently. 
We introduce a third system of quantities in terms of the primary coordinates. 
If z is a matrix and if z* is the conjugate-complex transposed matrix (“adjoint” 
in Hilbert Space terminology), then we put. 


Hence 
he 
2 
or 
Trp = 


= + i Xep 


The matrix r is hermitian. Furthermore if 
(41) M2, Tn, ) 
is an analytic expression in the quantities r,,; if we view the quantities {r,,} as 
dependent on {X,,}; and if we then perform the analytic continuation of replac- 
ing each X pg by Zq; then the function (41) will go over into 

, + 12m, Ze iZn , 
which in its turn is 
(42) glen, 22,21, °** 
In other words, if we consider a domain which is a tube in terms of secondary 
coordinates, then the transition from (41) to (42) is the continuation of the func- 


t‘on ¢ from the base of the tube into the tube proper. 
Such a tube will be now defined. It shall consist of those points of the space 


for which all roots of the matrix r are strictly positive, r > 0. Its basis (in terms 


of the components X,,) will be denoted by P,, and the resulting tube, which 
can also be described as 


reP,; < Vy < 


will be denoted by 7, . It is easily seen that P, is radiated, and the computa- 
tions to follow will imply that 7, is octant-shaped. In order to describe the 
conjugate set G we have to consider the expression 


(43) 8) = Bow 
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for an arbitrary real matrix 6. If we put 
App = Bop 
gg 

(44) : 

2 

then (43) is identical with 

(45) tr(ra) = . 


For fixed a, if a is diagonal, then tr(ra) will be positive for all positive r, if and 
only if, a itself is positive. For a non-diagonal we can put a = ua’u' where 
a’ is diagonal and u is unitary, and the relations 

tr(ra) = tr(rua’u™) = tr({u rule’) 
will lead to the same conclusion. Thus we see that G is a domain in the 6-vari- 
ables which in terms of the auxiliary hermitian matrix a is again described by 


a> 0. This domain will be denoted specifically by G, . 
As for the kernel of 7, , its value in P, is given by the integral 


(46) = [ ay, 


and, by our previous remark, its value in primary coordinates of 7, will be K(z). 
Suppose first that r is diagonal, rp, = Ap} Tp = 0, p ~ g. The transformation 


Apq 
WV pda Pp $ q 
or rather the corresponding transformation in the 6-variables, transforms G, 


into itself, and replaces dug by (dvg)/((A. An)".) Since An = det |r| 
and app = pp, we therefore obtain, for r diagonal, 


an 
(47) K(r) = (det [r})*? 
where 
(48) a, = re 


If r = ur’u’, where r° is diagonal and u is unitary, we replace in the integral, a 
by wau™*. On the basis of Lemma 1 in connection with the identity tr{ur’u'a} = 
tr{r°x‘au} it is now possible to verify that K(r) = K(r°). Since we also have 
det |r| = det|r°|, we may thus conclude that (47) holds for an arbitrary 
hermitian matrix r. In order to compute the constant a, , it is advisable to set 
up the more general expression 
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as) "(det dp, 


for s 2 n, and to prove the inductive relation rea | 
(60) = — n + 1)An-a(s) 
forn 2 2. The coordinates | 

(61) Bee; = 2,3,---,n 


are subject to the condition det | as. | > 0, and their totality describes a point- 
set that may be denoted by G,... If the quantities (51) are fixed, then the re- 


Jinates 
(62) Bu; By, Bas = 2, 
are variables subject to the restriction he 
(53) det © > 0, 
Bip — 
2 ’ pe 


and we are going to integrate (49) with respect to the variables (52) first. In 
order to simplify the computation we will first assume that the matrix | ape | 
ro 


is a diagonal matrix and the value of its determinant will be denoted 


by D. Since ér(8) = Bu + Bez + --: we thus have to compute the integral 


(54) pf du TT diy dB, 


the integration extending over the point set 
Bip + Bo 

pa > 0. 
p=2 4n, 


(55) Bu > 0, 
In terms of new coordinates 


_ Br 
B= Bu, = 4/4), Bu 


the integral is 
4" pr / - + i) ap II dé, dn, 


and the point set is 


n—1 ik 


6>0, 
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A direct computation gives the value 
D*"*(4r)""T(s — n + 1), 


and a re-check will show that we obtain the same value for non-diagonal matrices 
| Qe |. This completes the proof of (50). Since Ai(s) = I'(s), we uted obtain 


(56) A,(s) = — m+ 
and in particular | 


If we insert (47) in (25) we obtain the integral 


(2mi)™* Jiz, (det |Zp¢ — 
where fp, are primary and Z’, are secondary coordinates on the spine. Finally, 
if we replace secondary coordinates by primary coordinates even on the spine, 
we obtain the following theorem. 
THEOREM 3. 


f 
for functions in T,, which are of integrable square relative to secondary coordinates; 
the integration extending over the spine of T,, . 
In Part II we will transfer formulas (39) and (58) from the tube to a type of 


unit sphere, thus bringing them much closer to being Cauchy formulas, and in 
Part III we will make an analogous investigation for the general formula (25). 


Part II 


SPHERE-SHAPED DoMAINS 
7. The Cayley transformation 


If z is any matrix and 2* is the adjoint matrix, then the products zz* and z*z 
are non-negative hermitian and, although not necessarily equal, they have the 
same roots.’ The set of points z for which zz* < 1, (or z*z < 1) will be called 
the “unit sphere’. It will be denoted by S, in the non-symmetric, and by Sp 
in the case. explicitly, {zp,} belongs to S, (or if for every 
vector An) ¥ (0,--- , 0), 


< 


or if 
< 
* Compare P. R. Haumos, Finite dimensional vector spaces, Princeton (1942) 138-140. 
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Unit-matrices ¢ are those for which ¢¢* = 1. Thus if U, (or U%, in the symmetric 


of 
Wenow claim that for we have 
(59) det | #0... 


Since — 2 = (1 — and = < 1 it is sufficient 
to show that det | 1 — O for-z¢S,° If this were false, there would exist 
a vector \ * O for which 4, = 2A). On multiplying this relation with its 
conjugate complex, we would obtain A,A, = %,2-gAp, Which is impossible for 
zinS,. Since ¢ = —1 is unitary, we see that det | 1 ats) ¥ 0, esol opel 
(60) | +2 
(“Cayley transformation”). Obviously 
wtwt=(1+2) "(1-2 2)" 
=(l+2) — 2+) + + 
= 2(1 + 2) “(1 — 2e*)(1 + 24)". 
Now, in general, if p is positive hermitian and gq is arbitrary, then gpq* is again 
positive hermitian. Thus, 1 — zz* > 0 implies w + w* > 0, and the latter in- 
equality characterizes a point of the tube 7’, in primary coordinates. . 
Conversely, let we 7',. We certainly have det|1 + w| + 0. Otherwise 
we would have a relation \, = — W,pA»y which in its turn would lead to 


= — (Wye + Dep)ApAg - 
This however contradicts w + w* > 0. Thus we can set up the quotient 
1+w 
which is the formal converse to (60), and we obtain 
l—2z*=1- (1+ (1 — w)(l — w*)(1 + 
= (1+ + w)(1 + w*) — (1 — — w*)} + 
= 21 + w) fw + w*} (1 + wt). 


This implies 1 — zz* > 0. Hence we have proven 

Lemma 2, The Cayley transformation is a one-to-one map between S, and T,; 
or between S°, and T*,. 

Now, if ¢ is any point of Us, and if det |1 + ¢| x 0, then 


i+¢ 


(61) 


(62) o = 


t 
ices 
| 
ine, 
| i 
les; 
of 
in 
5). 
| 
|. 
| 
hy 
bid 


is a point of the spine 7Z;,. Conversely, if w is any point whatsoever of iE, , 
then ¢ = (1 — w)/(1 +) isa point of U,. Thus, if we denote by A; the sub- 
set of U, which is characterized by det | 1 + ¢| = 0, then (62) is a one-to-one 
map between V; = U, — AiandiE,. More generally, take any fixed number p of 
absolute value 1, denote by A, the subset of U, for which det |1 + pt| = 0, 
and put V, = U, — A, then the transformation 


is a one-to-one map between V, Now, if po = 
different values of p then {V,,, V,,,-::, Von} is & complete covering of U,, 
since the polynomial det | 1 + pf | cannot have more than n + 1 roots». Fi- 
nally, the intersection of A, with any V, is the image in U,, of the pointset 


11l-w 

det |1 +2 =0 
in iE; , and this is an analytic manifold of dimension =< k — 1. Thus, U, (and 
similarly U%,) is a k-dimensional analytic manifold and we have 

Lemma 3. The Cayley transformation is a one-to-one map between U, — A and 
iE, , where A is a lower-dimensional analytical subset of U, ; similarly, between 
U* — A° and ik. 

Finally, we will state a last pre-requisite. 

Lemma 4. The Jacobian 3(z)/d(w) of the transformation (61) is 


1 
(et 


(—2)" 


in the non-symmetric case, and 


1 
+ 


in the symmetric case. 
In fact, from 2(1 + w) = 1 — w we obtain successively 
dz-(1 + w) + z-dw = — dw 
dz-(1 + w) = —(1 + z)-dw 
dz = —(1 + z)-dw-(1 + w)" 
—2(1 + w)™*-dw-(1 + 
and Lemma 4 follows now from Lemma 1. 


8. The non-symmetric case 


We note that U,, , being a group space, is orientable. Therefore, the differen- 
tial df, --- dg, can be defined in the large, and its algebraic sign can be changed 
by a change of orientation. 
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TuorEM 4. Jf f(z) ts analytic and bounded in S, and has boundary values in 
the ordinary sense at all points of U,, , then the formula 

@) He) = Jy, det = 


is valid for z in S,. 
If we apply the transformations (61) and (62), we have by Lemma 4, 


and the relation 
= 2(1 + w) — + 
implies the equality 
(det | ¢ — 2|)" = 2™ (det | w — w| )*(det | (1 + + w) |). 


Also the exceptional subset A of U, , which is not affected by the transforma- 
tion, can be neglected in the integral (63) on the basis of Lemma 3. Thus, 
formula (63), whether true or false, is equivalent with formula 


F(c) don +++ de 
where 
S(z(w)) 
(66) PO) = Get + wp 
and 
1!2!--- — 1)! 
(66) Cn = 


By assumption, f(z(w)) is bounded; therefore F(w) is of integrable square in T’, , 
and its boundary values on the spine derive from the values f(¢) on U,. Thus 
(64) is true by Theorem 3, and the proof of Theorem 4 is completed. 


9. Group-invariant volume 
We note that in non-symmetric space, by Lemma 1, the differential form 


Us 
6 po: 


is invariant for arbitrary transformations ¢ — afb. For o unitary, ¢ + of is 
an automorphism of U, as group space and we can now deduce that (67) is the 


, 
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volume element of the group invariant volume on U,. In fact, we can define 
the additive set function 


df, +++ dt 
u(B) = Cp Get 


for Borel sets on U,,. It is group invariant, and it is absolutely continuous relg- 

tive to the group volume, therefore by. a general theorem’ it is a constant mul- 

tiple of the volume. Also »(U,) = 1, and thus it is identical with the volume. 

We will now denote (67) by. du, dnd since ¢ — 2.= (1 — z g*)f we nowobtain 
TurorEm 5. Under the assumptions of Theorem 4 we have 


JQ) d 


where dv is the group invariant volume element of U,, , with the entire group having 
volume 1. 
If we introduce the kernel 


(69) §) = det | 1 — 
then formula (68) reads 
(70) fe) = Hee; ao. 


The kernel (69) is analytic in 2, ---, ze; fi, --+, & and has a power series 
expansion 


71) =X Holes 

where each H,(z; §) is a homogeneous polynomial (with real coefficients) of 
dimension p in either z or §. Also H(z; §) = H(§; z) and therefore 

(72) H,(z; §) = H,(§; z). 


It is easily seen that the series (71) is uniformly convergent for ¢ varying in the 
closure of S, and z varying in some neighborhood of the origin; (actually, z 
may vary in any compact subset of S,). On substituting (71) in (70) we there- 
fore obtain 


(73) = Xf) 
p=0 
where 
(74) = Hyle; do. 


78. Bocunur, Additive set functions on groups, Annals of Math. 40 (1939), 787, Theorem 3. 
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By (74), f(z) must be of dimension p or identically 0, therefore the series (73) 
is the diagonal form of the power series of f(z). From this we conclude that 


0,p#q 
(75) zc NAG) dy = 


if we multiply both sides by g,(z) and recall (72), and integrate with respect 
toz, then we obtain the following theorem on unitary groups. 

TueorEM 6. Jf gp(f) and f,(¢) are homogeneous polynomials of different 
dimensions then 


ao) dv = 0. 
Also, if {g-(¢)} ts a complex system of orthonormal polynomials then 
m) H(e; 8) = 


the latter series being summed diagonally. 
As for series (77), since {y,(¢)} is a complete system of homogeneous poly- 
nomials, there exists an expansion, diagonally convergent, of the form 


with suitable polynomials ¢,(z). Their ee with ¢,(z) follows now from (70) 
if applied to f(z) = ¢,(2). 


10. The symmetric case 
This case has a complication.. We do not know whether the manifold U%, , 
which is not a group space is orientable, and therefore we are not justified in 
setting up the differential dg, --- dt, for the entire manifold. Nevertheless we 
can set it up for the manifold V°, = U°, — A° where A’ is the exceptional subset 
occurring in Lemma 3. Thus, in analogy to (63), we can now obtain the formula 


where c, is the constant (40). However there is an exact analogue to formula 
(68). It is the formula — 


f(s) dv 
(79) f(z) = [ (det — |) 
where formally, 


” (det 
In order to justify the analogy, we consider the group of affine transformations 
(81) wgu, 
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where wu is an arbitrary (non-symmetric) unitary transformation and wu’ is its 
transposed. These transformations are a transitive group of motions, and the 
existence of such a group leads to many properties similar to those of a group 
space proper. In particular there exists a group-invariant volume, and it is 
again unique, thus leading to formula (79). We will not insist on details of the 
proof. Obviously Theorem 6 has also a strict analogue in the symmetric case, 


Parr III 
GENERAL GrouP INVARIANT DoMAINs. 
11. An abstract theorem 


Let G denote a compact group of affine transformation operating on a fixed 
set of variables t = (t,,-+- , %). An element of G will be denoted by the letter 
o, and the actual transformation by ¢ — of. The transposed transformation 


_ will be denoted by o’. Thus, if (¢, u) is the inner product tm + --- + ty, 


then (ct, u) = (t, o’u). We term G a comprehensive group, if any polynomial 
in ¢ which is absolutely invariant for all transposed transformations o’ is of 
necessity a constant. For instance, the unitary group U, is a comprehensive 
group. In fact, if an element of this group is denoted by ¢ and if P(é) is a poly- 
nomial in the set of variables ¢ = {t,,}, and if P(¢) is group invariant, then 


(82) = P&) PO 
is identically 0 for ¢ « U,.. We now consider the function 
(83) S(@;t) = P(@t) — 


for an arbitrary (non-unitary) matrix z, and fixed numerical values of ¢t. Being 
a polynomial in z, we can apply to it Theorem 3, and since its boundary values 
on U,, are zero, it vanishes identically in z, that is P(zt) = P(t). If we putt = 1 
(unit matrix) we obtain 


P(z) = P(1) = constant, 


thus proving that the unitary group is comprehensive. In the case of the unitary 
group the property just stated is part of the so-called “unitarian trick,’”® and our 
concept of a comprehensive group is a generalization of this property, by explicit 
definition, to general affine groups. We will now state a theorem in which this 
concept has a vital part. ' 
TuEorREM 7. If a distributive operation from (all) polynomials to polynomials 
is commutative with partial differentiation and invariant under the transformations 
of a comprehensive group, then the operation is a constant multiple of the identity. 
If we denote the operation by 


(84) F(x) = Lf(é) 
then the assumptions are 


*H. Weyt, The classical groups, Princeton (1943), 177, Lemma (7.1.A). 
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(85) L(af + bg) = alf + blg 
op 

(86) iz, F(z) pe=l,-,k, 
(87) F(ox) = Lf(ct); 
and the conclusion is 
(88) F(x) = cof(x), 
where ¢o is a constant. We denote the polynomial L(é?' - - - ¢?*) by 
(89) 
and we introduce the formal power series 
(90) F(z; 1) = +++ 


Pil +++ pe! 
Assumption (86) is equivalent with the system of relations 


oF = eee 
(91) az, tpF; 


If we introduce the new expression G(x; t) = ¢ *'” F(z, t), then these relations 
ae = Theweteie. Gia: #) in x, and. thus we obtain for the 
system of polynomials (89) the generating function 

(92) F(z; t) = e*A(h, 


where A(t) is a formal power series, as yet unrestricted. In order to apply 
the critical assumption (87) we introduce in addition to the series (90) the 
formal series 


and after extending our operation to formal power series with polynomials as 
coefficients we obviously have 


F(x; t) = 
By including assumption (87) we obtain the chain of equalities 
A(t) = Flox; t) = Le“? = Le"? = F(a; o't) = A(o't), 
and hence the relation 
A(t) = A(o’t). 


Since the group is comprehensive, this relation can hold only if the power series 
A(t) reduces to a constant ¢, that is F(z, t) = we”. Therefore, 
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and this is the conclusion (88). We observe that there is nothing in our assump- 
tions to prevent the constant co from having the value zero. 

A seemingly very special type of distributive operations having property 
(86) is given by the expression 


F@) = Die + Mim, 
where {£”} are fixed points, and {yu} are fixed constants. However a transition 
from sums to Stieltjes integrals will lead from the special type to what is well- 


nigh the most general type. If C is any bounded Borel set and w(A) is a com- 
pletely additive set function of its Borel subsets, then the expression 


(94) F(a) = | fe + 8) 


will obviously define a distributive operation having property (86). It will be 


easily seen that in order to obtain property (87) it is sufficient to assume that 
the set C and the set function w(A) shall be group invariant; that is oC = C, 
and w(oA) = w(A). 

An important type of Stieltjes integrand is generated by an external dif- 
ferential form. We assume that C is an orientable compact m-dimensional 
differentiable manifold C,, ,0 < m < k, and we assume that a differential form 


(95) dw(t) = dip, dé, 

is defined in a neighborhood D of C,,. The set function w(A) is then defined 

as / dw(é). The set function will be group invariant if the manifold C and the 
A 


differential form are group invariant. This being so, if the group G is a compre- 
hensive group, then the conclusion of Theorem 7, if written explicitly, is the 
formula 


We will also write it more succinctly in the form 


(97) = cafla). 


The formula is valid for the class of polynomials and in most cases, for no class 
of function which is substantially larger. It is easy to state a further condition 
under which the constant c is ~ 0; the condition being that the group @ shall 
be transitive on C,,. In fact, if G is transitive, the set function w(A) must be a 
numerical multiple of a group invariant volume, me co is this multiple, thus 
being different from zero. 


| 
then 
by 
(98) 
cu 
The 
stan 
| ed 
i 
Stol 
(10 
Nov 
(rea 
i fun 
fere 
‘ip 
| 
of 
| 
fy 


mn 


CAUCHY’S FORMULA IN SEVERAL VARIABLES 705 


12. Cauchy’s formula 


If, for a fixed point x, we replace the variable ¢ in (97) by the variable & — a 
then the manifold C’,,, goes over into a translated manifold which we will denote 
by Cn(x), and formula (97) goes over into 


The integrand in formula (98) has the shape we are aiming at, however the 
contour C'»(z) is dependent on x; whereas in a ‘‘Cauchy formula” as we under- 
stand the concept the contour must be independent of zx. We are thus naturally 
led to seeking conditions under which 


aoe - 9 =0. 


By assumption, the differential form (95) is defined in some neighborhood D of 
(,,. Hence there exists a neighborhood N of the origin, such that for z e N, 
(,(z) is also situated in D. It is easily seen that C,,(z) — C is the oriented 
boundary of an (m + 1)-dimensional complex B in D. If by the theorem of 
Stokes, we transform the left side of wi into an integral over B, then the relation 
(99) will hold provided 5 
(100) df — + f:(dwl§ — z))’ = 0. 
Now, if the latter relation were to hold for all polynomials f(é , --- , &) in the 
(real) variables & , --- , &, then it would hold locally for general differentiable 
functions f(¢) and this would imply that the differential form (95) is identically 
0. In other words there exists no Cauchy formula in real variables. 

However the situation is different for complex variables and the reader will 


have no difficulties in proving the following theorem. 
TueoreM 8. If , are complex variables and , are conjugate 


complex variables; if 0 < m < k; if the tensor 

is defined and differentiable in the neighborhood D of a compact orientable dif- 
ferentiable manifold Cr4m of dimension k + m; if the external derivative of the 
differential form 
(101) dw(¢; f) = dt K5,..-20(85 df», dy, 


is identically zero; and if the manifold ht the differential form are invariant under 
the transformations 
— of a} 


of @ comprehensive group G; then the relation 


Ck+m 


holds for all analytic polynomials f(t) = $+). 
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If the group G is transitive on Cr+ , then the constant @ is ¥ 0. 

The case m = 0 is the case of complex variables at its purest, whereas the case 
m = k — 1 is nearest to the case of Green’s formula for real variables. The 
latter case has been treated extensively in a previous paper,’ and will not be 
commented upon. In the case m = 0, the tensor reduces to a scalar K(¢; §) 
and the differential form (101) will be 


&) df dt. 


Its derivative is identically zero whenever K(¢; &) is independent of the con- 
jugate quantities §,,---,& thus being an analytic function K(t) = 
K(f, Thus we obtain formula 


cafte) = 2) ds dks 


provided C;, and K(f) dg, --- df, are invariant for some comprehensive group 
in the complex variables given. This formula obviously includes formulas 
(63) and (68). 


ADDENDUM 
Another Special Case 
We consider in the space of the real variables wu; , --- , uz , k 2 2, the semi-cone 
(P) > (ua + 


The square-root is positive and therefore u is automatically positive. The 
other variables are not algebraically restricted. It is easily seen that the relation 


mth + aw, > 0 
will hold for all win P if and only if 


(@) > + of)”. 

Furthermore, by a known formula”, 


_ Thus we obtain the formula 


r(§) 
F(w) = [ F(w) don +++ door 


for functions of integrable square in the tube with basis P. 


*S. Bocuner, Analytic and meromorphic continuation by means of Green’s formula, 
Annals of Math. 44 (1943), 652-673. 
”§. Bochner, Vorlesungen ueber Fouriersche Integrale, p. 189. 
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Following E. Cartan” we now introduce new variables by the relations 


= 


p = 2,---,k, 
and this leads to the formula 


k 
(3) | dts ++ 
Qi Sy — $1)? + + (ee — 


The characteristic manifold U is described by 
hl = lhl 
and the domain of validity in z is described by 
+a—al<la| 


Ala +al’. 
This domain can be proven bounded. 
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11 Domaines borne’s homogénes de l’espace de n variables complexes. Hamburgische 
Abhandlungen, 11 (1936), p. 149. 
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BOUNDARY VALUES OF ANALYTIC FUNCTIONS IN SEVERAL 
VARIABLES AND OF ALMOST PERIODIC FUNCTIONS 


By 8. Bocuner 
(Received June 22, 1944) 


INTRODUCTION 
If an analytic function f(z) in |z| < 1 is such that 


| an 
is bounded for 0 < r < 1 then there exists a boundary function f(e) such that 
(1 timean — se") | do = 0; 
or if we “eliminate” the boundary function f(e”) then relation (1) is equivalent 
with the relation 


— floe*) | = 0. 


This theorem of F. Riesz’ has the distinction of being true only for analytic 
functions and not for harmonic functions in general as many similar theorems 
are. Correspondingly, the proof is very “analytic.” It is based on the canoni- 
cal decomposition 
f@) = g()-Be) 

where g(z) # 0 and B(z) has boundary values 1 almost everywhere (Blaschke 
product). 

Now, for functions of several complex variables no comparable decomposition 
exists, and yet we will succeed in generalizing the theorem of Riesz proper. If 
fla, %) is analytic for 0 S |z;| << =1,---,k, andif 


is bounded then there exists an integrable boundary function f(e, --- , e”*) in 


such that 


1 Ueber die Raudwerte einer analytischen Funktion, Mathem. Zeitschrift, 18 (1922), — 
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tends to 0 as (m1, +++, T2) — (1, -+-, 1). The theorem can be stated entirely 
in terms of Fourier series. If we consider an additive set function of bounded 
variation on the Borel sets of the torus (2); if we introduce its Fourier series in 
the usual manner; and if the series has the special form 


eee Om y+++my em 1+ 
m,=0 


that is if the Fourier coefficient @,,,...m, vanishes whenever at least one index m; 
is negative; then the set function is of necessity absolutely continuous and 
therefore the indefinite integral of a Lebesgue point function. Actually we will 
show that for the conclusion to hold the non-vanishing Fourier coefficients need 
not be restricted rigidly to the “‘octant” 0 < m; < »& but may be contained in 
any convex cone which is strictly conical. 

The theorem of Riesz can be deduced from a sharper theorem of Hardy- 
Littlewood’ which states that the inequality 


2x 
|f(re®) |d8<y, O<r<il 
implies the sharper inequality 


where a is a universal constant. Now, if we replace the variable z by e * then 
the theorems of Riesz and Hardy-Littlewood are theorems on periodic analytic 
functions which are bounded in the half-plane x > 0. Our procedure will be 
to give at first a qualified extension of the theorem of Hardy-Littlewood to almost 
periodic functions of Bohr. This done, it will be quite easy to obtain the stated 
theorems in several variables by applying a fundamental relation which connects 
the two categories of functions. 

After that we will turn our attention to almost periodic functions in their own 
right. We will give both an unequalified extension of the theorem of Hardy- 
Littlewood and a full generalization of the theorem of Riesz. The “boundary 
function” whose existence is involved in the theorem of Riesz will be a Besi- 


m,=0 


covitch function. Nothing less general can be guaranteed. However the, 


analytic function given in x > 0 may be a Besicovitch function to start with, 
and as a matter of fact we will set it up in a new scope of generality. 
I. A Lemma on MEAN VALUES 
As a rule we will denote by M,f(y) the mean-value 


*A maximal theorem with function-theoretic applications. Acta Mathematica, 54 
(1930), 81-116. 
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provided it exists. We will also write M,(y) for the limit superior and M, f(y) 
for the limit inferior, whenever f(y) is real-valued. 

1.°. If K(y) is an even function in ©); if < <a 
K(y) is differentiable and monotonely decreasing; and if 


[ xw 


thus implying that yK(y) — 0 as y — @ ; then for any non-negative function g(y) 
which is Lebesgue integrable over any finite interval we have the relations 


and 
(6) KG) dy lim [ oly) « dy: 


Proor. Since K(y) was assumed even, our relations may be proven in the 
form 


(7) [ KG) dy < tim [ oly) KC) dy 


where ¢(y) is defined only in 0 S y < @ and where for the remainder of this proof 
T 
only, the symbols M and M are the lim superior and lim inferior of ; I g(y) dy 


as T 

The reader will next convince himself easily that we may now also assume 
that ¢(y) vanishes in a neighborhood of the origin. In particular, if we introduce 
the function 


== [" o at 


we may assume that 


< 7, 


Now 


« K(ey) dy = yK’' (ey) ®(y) dy 


where —K’(ey) 2 0. Corresponding to any p > 0 there exists an A, such that 
&(y) < Mywly) + p, A<y<*». 


* Compare S. Bochner, Fouriersche Integrale, p. 30. 
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Therefore 


and this proves (6). On the other hand, for each p > 0 there exists a B such that 
dy) = Mywly) — p, B<y<~», 
and (7) follows easily from 


II. A THrorem or Harpy-Lirrtewoop 


THEOREM 2. There exists an absolute constant a having the following properties. 
(i) If 


| a0 1, 0<r<l, 

then 
(8) supocr<: | f(re”) | < a-y. 

(ii) If 
(9) [tet 
then 

(iii) If 
(11) [116 + iy) | dy 
is bounded inO T < <a < ~, for every fixed > 0, and if 
(12) M, | f(x + iy) | <7, 
then for all x, > 2% > O we have 
(13) M{max,,<e<e, | f(x + iy)} S 


(iv) The best constants a in each of the relations (8), (10), (13) have the same 
values. 
Proor. Ad (i). This is a renowned theorem of Hardy-Littlewood.’ The 
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proof will not be repeated. The other parts of the theorem will be derived 
from (i). 

Ad (ii). Ifr = r(@) isa continuous function in 6 6, where < 4 < 
6; < a, and if 0 < r(6) < 1, then (8) implies 


(14) | | dd < ar. 
2m Jo, 

Now, let f(z + iy) be analytic in x > 0, and let 
(15) = Sy 8} 
be a fixed rectilinear polygonal Jordan arc situated in a fixed rectangle 
(16) 
where 


If a > 0 is sufficiently large, then the circle 


inet 


will have the following properties. The curve (15) is contained in its interior, 
and any radius of the circle which meets the curve at some point meets it at 
one point only. Therefore, relation (14) implies the relation 


(a — z(y))? 


(a—p/2) 
Sa [ 
™(a—p/2) 
We now let a—> ». The expression on the left converges to 
+ | ay. 
In the integral on the right the path of integration converges to the straight 
line x = 4 —2° < y < o, and the integral will itself converge towards its 


formal limit which is 
+ 
provided the function f(z) in the half-plane z > 0 satisfies an estimate 


(17) 


dh. 


— 


— 
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ay. Thus for a function of this description the last inequality implies ‘ 
9) seo) + | ay s 


Next, if f(z) is an arbitrary function for which (9) holds then a sequence of ; 
functions fo(2) can be found which approximate to f(z) in the norm (9) and such : 
that each fo(z) satisfies an estimate (17) for some M. The sequence converges 
uniformly in (16), and thus (18) holds for f(z) in general. We now hold the 
function f(z) and the rectangle (16) fast, and vary the curve. By convergent 
sequences of curves we can reach any semi-continuous functions y(x), and 
therefore (18) implies 


[ MAXp<e<¢ | f(x + ty) | dy S a-y. 


letting p > 0, +0, r— we finally obtain (10). Fi | 
Ad (iit). For0 < € < € we introduce the function | 
| 
Since (19) is bounded for x > £(> 0) we see that } 


is bounded for x > &, and by (ii) we obtain for 2; > 2% > & the inequality q i 


| + ty) |dy Sa | + ty) | dy. 


= 


Therefore, | 


MAXzsese, | f(x + ty) | + iv) | + iy) | | 

We now multiply both sides by « and apply Theorem 1. If we put K(y) = ; 
(1 + in (4) we obtain 


lim sups+o ef. lay < M,\ fe + 


and if we put K(y) = [(1 + éxito)” + y'J” in (5) we obtain 


M, maxzy<ese, | f(x + ty) | 1 rape 


| f(x + ty) | dy 


If we put together the last three relations and let « —> 0 we finally obtain (13). 3 . 


t at 
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Ad (iv). If a, a2, a3 are the best constants in (8), (10), (13) respectively, 
then our argument shows a1 = a2 = as. However ai S az and thus 
a, = a = a,, as asserted. 

TurorEmM 3. If f(z) is an analytic almost periodic function‘ in x > 0 {that is 
in each half planet S x < ~, — > OI, and if 


(20) M, | f(z + ty)| = 7, 0<2< 
then for each x > 0 we have 
(21) M, | f(x + ty) |} S ary. 
Proor. In fact, relation (13) is in the present case 
(22) | f(z + ty) |} ay 


for 2; > 2% > 0. Also, (20) implies that the exponents in the Dirichlet series of 
f(z) are non-negative, and therefore for x; — © the integrand in (22) converges 
to the integrand in (21), uniformly in —~ < y < «. This proves (21). 

If f(z) is pure-periodic we may pass from (21) to 


(23) M,{supocece | f(t + ty) |} S ay 


by ordinary Lebesgue theory. However, for almost periodic functions such an 
unqualified passage to the limit is not admissible without a detailed investiga- 
tion, and the very existence of the mean-value (23) is a matter requiring in- 
vestigation. We will not investigate this problem in the direct manner just 
outlined; but we will see in Part IV that there is a possibility of generalizing 
relation (23) to the almost periodic case. 


III. Functions inv SEVERAL VARIABLES 


We will describe a well known connection between periodic functions in several 
variables and almost periodic functions in one variable. For fixed k, we consider 
the class of all continuous periodic functions 


on the torus 

(25) 0 St; < 2z, j=1,:::,h, 
and we take a fixed set of positive real numbers 

(26) Mi, 5 Me 


which we assume to be linearly independent with regard to rational coefficients. 
Thus there is no relation of the form 


gu + = 0 


‘ As defined by H. Bohr, and not in any more general sense, for the present. 
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= g, = 0. We now make the substitution 
= at, eee 

and the resulting function 

(27) g(t) = flit, +++ , mit) 


is a Bohr almost periodic function with the expansion 
+ t 


for integers {g;} other than g; = -- 


We will also denote the function g(t) by 


L(f) = L(f; t). 
The transformation 
f- lf 


is functional operation having the following properties. It is distributive, and 
it also transforms the real and imaginary parts of f separately into the real and 
imaginary parts of g. It is multiplicative, 


L(fi-fe) = Lfi- , 
and if f, — f uniformly, then Lf, — Lf uniformly. Therefore if 
(28) 
are real functions, and if 
A(ur, Un) 


is a continuous function in the real variables w , --- , Un Which is defined and 
continuous on the closure of the trajectory (28), then 


Furthermore, the range of values of g(t) is dense in the range of values of f(t), 
and in particular, 


sup; | g(t) | = max,| f(i, --- ,t&)|- 
This will be of no consequence; but what will be of consequence is the relation 


or written differently, 
M.L(f; t) = Mifla,-::, &).- 
Now, for p = 1, and indeed for p > 0, the function 


Maur. we) = (ut + uz)?” 


thus 
| 
hat is 
0, | | 
| 
| 
1 an | 
iga- | 
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is continuous. Therefore if we put f = f, + tf2 and apply (29) we see that 


L(|f|") =| 

and if we substitute this in (31) we obtain 
(31) M,|Lf|? = M.|f|’. 
Furthermore, the function 

Un) = max Un) 
is continuous in uw, = 0, --- , u, = 0, therefore if functions 

fay 

are given, and if 


g = Lf, 
then 


max = L(max 
and thus we obtain also 
(32) M, max = max fa |}. 


We are now ready for the main theorem stated in the Introduction. 
THEorEM 4. If a power series 


f(s, ooo, Sx) = coe >: Ging 
m,=0 
in the complex variables 
= + j=1,--- 


is absolutely convergent in the domain 
(33) m1 
and if for all (01, --- , ox) in (33) we have 


1 
(34) af | flor + tt, +++, + tty) | dt, dhe < 
then the limiting series 


(35) 


is the Fourier series of a function of class L’, 
More generally, if a Laurent series 
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is absolutely convergent in a k-dimensional domain P of the (01, --- , ox)-space, 
if P is the union of half-lines issuing from but not containing the origin, and if 
relation (34) holds in P, then the limiting series is the “ourier series of a function 
in L’. 

Proor. Since P is a domain, and thus contains a neighborhood, it contains 
a point whose coordinates are linearly independent. Denote them by 

By assumption, P contains the half-line 
oj = j=1,---,k; 0<u< 
Therefore we can introduce the function 
g(w) = f(uwu, WwW =ut+w, 


of the one complex variable w. This function is almost periodic, and, on the 
basis of formula (31), our relation (34) implies the kindred relation 


(36) M, | g(u + iv) | Sy, 0<u < 
since in terms of our previous operation we have 
gu + = LUf(uu + + tt). 
Now, take a sequence of numbers 
> U2 > us > --- 0, 
and consider the function 
= g(un + it). 

The latter is the transform of 

Fat) = f(urtn + th, +++, wun + tte). 
By theorem 3, relation (36) implies 

M,max Sev, 
and by relation (32) this is equivalent with 


(37) M, max 
n= 1, 2,---. By ordinary Lebesgue theory on the torus (25) this implies 
that the sequence of functions , , is majorized by an integrable func- 


tion on the torus (25). On the.other hand, their Fourier series are convergent 
term-by-term to the limiting series (35). By a familiar lemma, the limiting 
series is the Fourier series of a function in L’, and this completes the proof of 
theorem 4. 

We note that in the first part of theorem 4, the assumption that our Laurent 
series shall be strictly a power series was redundant. It follows from the assump- 
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tion that relation (34) shall hold in the entire ‘“‘octant” (33). Similar restric- 
tions on the power series are implied in relation (34) depending on the type of 
domain P we choose in lieu of the octant (33). In fact, if P contains a half-line 
(oyu, +++, oxu),0 <u < then relation (34) implies that a coefficient an, 
can be different from zero only if 


(38) + +++ + mox 2 O. 


This restriction is automatic. However if we are willing to introduce this re- 
striction explicitly, then we may obtain another type of theorem. 

TuroreM 5. If (35) is the Fourier series of a set function of bounded variation 
on the Borel sets of the torus (25), and if the Fourier coefficients are ¥ 0 only for such 
indices m,, +++ , m as satisfy relation (38) for all (01, +++ , ox) of some k-dimen- 
sional neighborhood N, no matter how small, then the set-function is absolutely con- 
tinuous, and thus (35) is the Fourier series of a function un L’. 

Proor. We denote by P the point set consisting of all half-lines issuing from 
the origin and each containing at least one point of N. Obviously, (38) is satis- 
fied for all points of P. We also introduce a sequence of exponential polynomials 


which converge term by term towards (35) such that 
&) | <7, 


where y is the variation of our set function. From the restriction on the Fourier 
coefficients it follows that we also have 


(39) M,| ane | 


for (01, °*:, ox) in P. We can now let »v > ~ in (39), and then apply theo- 
rem 4. 


It should be noted that theorem 5 is quite sharp; it is not permissible to let 
the neighborhood N shrink to a point. In fact, for k = 2, if 


is the Fourier series of a set function in one variable which is not absolutely 
continuous, then it is the Fourier series of a set function in two variables which 
is not absolutely continuous, although 


+ = 0 


for o, = 0, o2 = 1 and all integers m, . 

REMARK ON L”, p > 1. The latter comment emphasizes the distinction be- 
tween our theorem concerning L’, and a theorem of M. Riesz concerning L’, 
p > 1. The theorem of M. Riesz states that if 
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is the Fourier series of a function in L’, then so is its “portion” 
Qin gm, 
m=0 


As we have demonstrated somewhere else,’ its generalization is that whenever 
(35) is a series in L”, then so is its “portion” 


» 


this portion arising by dissection of the original series along any single one hyper- 
plane. The fact is that the theorems of M. Riesz on conjugates is a theorem on 
functions in real variables (some of its proofs notwithstanding), whereas the 
theorem we are discussing in the present paper is not so. 


IV. Atmost Periopic Funcrions 


We will consider in — «© < y < ~ almost periodic functions f(y) of the most 
general kind, such being the functions of the Besicovitch class B’. They result 
from closing the space of Bohr functions by the norm M, | f(y) |. We will say 
that the functions f(y), g(y) are equivalent if 


M, | f(y) — gly) | = 9. 


In many important respects equivalent functions cannot be distinguished from 
one another. If f(y) and g(y) are both real, then we will say that g(y) exceeds 
f(y) if the difference g(y) — f(y) is equivalent, in the sense just described, with 
anon-negative function. Investigations of almost periodic functions on general 
groups’ have revealed many properties of Besicovitch functions analogous to 
those of periodic functions. In particular we will require the following two 
properties which are implied in general results. 

Property 1. If a sequence of real functions in B’ are monotonely increasing, 
S Gnaily), and wf their norms are bounded from above, 


Myg.ly) n=1,2,-:-, 
then there exists a function g(y) in B’ which exceeds all g,(y) and for which 
Mygly) = v. 


Property 2. If {f,(y)} is a sequence of complex functions in B’, if all | f.(y) | are 
exceeded by some g(y) in B’, and if, for n — ~, the Fourier series of the sequence 
fly) are convergent term-by-term towards a limiting trigonometric series, then the 
latter series is again the Fourier series of some function in B’. 

There is no clear-cut definition of functions B’ of the complex variable z = 


°S. Bochner, Additive set functions on groups, Annals of Math. 40 (1939), 769-799. 
* In addition to the author’s paper quoted in 5) see E. van Kampen, Almost periodic 
functions and compact groups, Annals of Math. 37 (1936), 78-91. 
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i tion that relation (34) shall hold in the entire “‘octant’’ (33). Similar restric- is 
he tions on the power series are implied in relation (34) depending on the type of 


( domain P we choose in lieu of the octant (33). In fact, if P contains a half-line 
(oyu, 0,u),0 <u < ©, then relation (34) implies that a coefficient ap, », 


' can be different from zero only if A 

(38) + +++ + 2 0. 
Vi This restriction is automatic. However if we are willing to introduce this re- 

| striction explicitly, then we may obtain another type of theorem. th 


TuroreMm 5. If (35) is the Fourier series of a set function of bounded variation 
! on the Borel sets of the torus (25), and tf the Fourier coefficients are ¥ 0 only for such fu 
indices m,, «++ , Mx as satisfy relation (38) for all (01, +--+ , ox) of some k-dimen- th 
sional neighborhood N, no matter how small, then the set-function is absolutely con- 
tinuous, and thus (35) is the Fourier series of a function in L’. 

Proor. We denote by P the point set consisting of all half-lines issuing from 
the origin and each containing at least one point of N. Obviously, (38) is satis- 


| i fied for all points of P. We also introduce a sequence of exponential polynomials ; 
ce which converge term by term towards (35) such that 

M.\f%(, 
where y is the variation of our set function. From the restriction on the Fourier or 
coefficients it follows that we also have ' i( 
a 
for (01, --:, ox) in P. We can now let vy — © in (39), and then apply theo- tt 
rem 4. P 

It should be noted that theorem 5 is quite sharp; it is not permissible to let 
the neighborhood N shrink to a point. In fact, for k = 2, if 4 
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is the Fourier series of a set function in one variable which is not absolutely 

continuous, then it is the Fourier series of a set function in two variables which 

is not absolutely continuous, although 

02 0 = 0 
for o; = 0, o2 = 1 and all integers m, . : 

Remark ON L’, p > 1. The latter comment emphasizes the distinction be- 

tween our theorem concerning L', and a theorem of M. Riesz concerning L’, 
p> 1. The theorem of M. Riesz states that if 
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als from closing the space of Bohr functions by the norm M, | f(y) |. We will say 
that the functions f(y), g(y) are equivalent if 
M, | f(y) — g(y) | = 9. 
In many important respects equivalent functions cannot be distinguished from a 
cor one another. If f(y) and g(y) are both real, then we will say that g(y) exceeds | 
f(y) if the difference g(y) — f(y) is equivalent, in the sense just described, with } | 
a non-negative function. Investigations of almost periodic functions on general a 
groups’ have revealed many properties of Besicovitch functions analogous to 
those of periodic functions. In particular we will require the following two | 
properties which are implied in general results. | 
m Property 1. If a sequence of real functions in B’ are monotonely increasing, | 
gn(¥) S Gnsily), and rf their norms are bounded from above, | 
Mygn(y) n=1,2,-°-, | 
ty then there exists a function g(y) in B’ which exceeds all g,(y) and for which 


Mygly) < v. 


Property 2. If {f,(y)} is a sequence of complex functions in B’, if all | f,(y) | are 
exceeded by some g(y) in B’, and if, for n — ~, the Fourier series of the sequence 
f(y) are convergent term-by-term towards a limiting trigonometric series, then the 
latter series is again the Fourier series of some function in B’. 

p There is no clear-cut definition of functions B’ of the complex variable z = 


°S. Bochner, Additive set functions on groups, Annals of Math. 40 (1939), 769-799. | 
*In addition to the author’s paper quoted in 5) see E. van Kampen, Almost periodic Fi \ 
functions and compact groups, Annals of Math. 37 (1936), 78-91. 
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x+ iy. All that can be required without prejudice is that the function shall have 
a Dirichlet expansion of the form 


(40) 2, 


in the sense that for all x of some interval it shall be a B’-function in the y-vari- 
able having the Fourier series 


(41) > 


Now, for our purpose it will be best to require just that and nothing more. Thus 
we start from a formal series of the type (40), and we assume that for each z in 
0 < x < o, the series (41) is the Fourier series of some element in B’. De- 
noting this element by f(x; y) we obtain a family of functions, where the variable 
x is a parameter designating membership in the family. We will call f(x; y) a 
formal function. 

TuEorEeM 6. If the formal function f(x; y) inO0 < x < ©@ satisfies the in- 
equality 


(42) 
then there exists a B'-function g(y) which exceeds | f(x; y) | for every x, such that 
(43) Mygly) S a-y, 


where a is the universal constant of theorem 2. 
Proor. We associate with (40) a sequence of approximating exponential 
polynomials 


(44) Mey 

of the Fejer-type.’ They have the properties 

(45) |f (Ey) — fy) |=0, 
(46) My 


On account of (42) we may assume that all \, are non-negative, and therefore, 
by a known theorem® we have 


(47) My y) | = | 
and 
(48) M, = y) — | 
for 


Letting | — , this implies 
(49) M, \ f° = y) — Fy) | 


7 A. 8. Besicovitch, Almost periodic functions, 1932, p. 103. 
* Hardy, Ingham and Polya, Theorems concerning mean values of analytic functions, 
Proceedings of Royal Soc. Lond. 113 (1927), 542-569. 
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for 


O<t<a. 


Since the function (44) is uniformly continuous in any strip we hence conclude 
the continuity property 
(50) lims..M, | f(é; y) — f(x; y) | = 0 


> 0. 
We now apply theorem 3 in the half-plane x > &, instead of x > 0. In par- 


ticular, if 21, --- » % are real numbers, all > £, then 
(51) My maxigese | y) | 


From the expression 


and from similar expressions for max (uw, --- , u,) the reader will conclude that 
we may let k — © in (51) and hence obtain 


(52) M,9.(y) S a-y, p= 1, 2, 
where 
(53) = Maxiges, | f(te; y) | 


is a B’-function each. We now take a sequence of numbers {z,} which is a 
dense set in 0 < 2 < o, and we form the sequence of functions (53). They are 
increasing, and, by (52), their norms are bounded. By Property 1 of B’-func- 
tions, there exists a limit-function g(y) which exceeds all g,(y) and hence all 
| f(z; y) |. Thus all functions 


— | f(x»; y) | 


exceed the function zero. By (50) it is now possible to conclude that g(y) — 
| f(é; y) | exceeds zero for all £ > 0, and this completes the proof of our theorem. 

TuEeorEM 7. If f(x;y) is a formal function with expansion (40) and if (42) holds, 
then there exist a B’-function f(0; y) with expansion 


(54) 
such that 
(55) lim, 0M, | {(0; y) — f(a; y) | = 0. 


Proor. Since the functions | f(x; y) | are exceeded by a function g(y), the 
existence of a function with expansion (54) follows from Property 2 of B’-func- 
tions. Relation (55) will then follow, if we approximate to f(0; y) by our ex- 
ponential polynomials 


= ac™ 
and then apply (48) and (49). 
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ReMARK ON Functions B’, p > 1. The following theorem is a secondary 
result since it applies to harmonic functions in general. 

TureoreM 8. For p > 1, if h(x; y) is a function of class BP in <y < 
for0 <a < ~, af its Fourier series has the form 


e COS Any + bn Sin Any) 
and if 
M,|h(z;y)|7S7, 
then there exists a B?-function h(0; y) whose Fourier series is 


(56) >> (dn COS Any + Dn Sin Any) 
and 
(57) limz 0M, | h(0; y) — h(x; y) |” = 0. 


Proor. By a property of B’-functions, the existence of the function h(0; y) 
with series (56) follows directly from the facts that the family of functions 
{h(a; y)} is bounded in norm and that their Fourier series are convergent term- 
by-term asx— 0. Relation (57) follows again by applying algorithmic summa- 
tion. 

Finally we will demonstrate that there is no analogue to theorem 5 for almost 
periodic functions of unrestricted sequences of exponents {),}. 

A counter-example. We take a sequence of positive numbers we, us, 
with the properties that no two general partial sums 


Hn, + + bn, 


forl Sr < < m < < n,, have the same numerical value. As for 
the existence of such sequences we note that any sequence of numbers which 
decrease to zero contains a subsequence of the given description. If we now 
form the functions 


fle) = +6"), 


m=1 


then 
Myfn(z + ty) = 1, 


for all n and all x; also the expansions of f,(z) converge term-by-term, as n — ~, 
towards a series containing as sub-series the expansion 


However, for um — 0, the terms {e“™"} do not converge to zero for any 2 as 
m — , and thus the entire series cannot be the Fourier series of-a B’-function 
f(x; y) for any real value x whatsoever. 
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TRANSFORMATION THEORY OF NON-LINEAR DIFFERENTIAL 
EQUATIONS OF THE SECOND ORDER 


By Norman LEVINSON 
(Received January 28, 1944) 


1, Introduction 


Interest in the field of non-linear differential equations in bygone years centered 
around the dynamics of conservative systems, the problems arising mainly in 
celestial mechanics. Current interest is focused mainly on non-conservative 
systems. Nevertheless many of the advances made in the past are of the greatest 
relevance to the problems of current interest. Therefore we shall give an 
account here of certain of these past results relevant to current problems. We 
shall also adapt certain methods so as to get results for these current problems. 

So far as the non-linear equations that we shall consider are concerned, the 
most challenging problem mathematically appears to be that of finding means 
of excluding the possibility of certain singular situations. We shall describe 
some of these singular possibilities. As yet there is no method available to 
indicate under what conditions they cannot arise. 

Transformation theory as a method in differential equations is due to Poincaré. 
The type of transformation we shall be interested in here is that of the Euclidean 
plane into itself. Certain methods and results of Birkhoff will be shown to be 
of interest. 

In connection with their work on second order nion-linear equations, the 
non-linear terms-of which have a small parameter as a coefficient, Kryloff and 
Bogoliuboff’ make use of transformation theory, particularly the theory that has 
been developed in the study of curves on a torus by Poincaré and Denjoy. 

The second order equation 


| (1.0) + f(x, + g(x) = 


with e(t) of period L, is of considerable interest in applied mathematics. By 
setting « = y this equation becomes 


(1.1) = —f(z, yy — g(x) + 
The system (1.1) is a particular case of 
(1.2) ¢=F(z,y,t), y=G@,y, 0d), 


where F(x, y, t) and G(z, y, ¢) are both periodic in ¢ with period L. We shall 
assume that F and G are analytic in x, y, and ¢ although much less would suffice 
for most of the discussion. 


Krytorr N. N. Bocotrusorr, Les Methodes de la Mécanique Non Linéaire 
Appliquées 4 la Théorie des Oscillations Stationnaires, Monograph No. 8, mee of Sci- 
ences of the Ukraines, 1934. 
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A system of the form (1.2) will be said to be of class D, or to be a dissipa- 
iive system for large displacements, if there exists an R such that any solution 
(x(t), y(t)) of this system remains in the finite (x, y) plane as ¢ increases and 


(1.3) lim sup 2°(é) + < R’. 


That is, any solution of a dissipative system of the form (1.2) eventually lies 
inside of a circle in the (zx, y) plane with center at (0, 0) and with radius depending 
only on the system. As might be expected from energy considerations, most 
systems which arise in practice are in class D. 

This is a consequence of the following result: Under rather general conditions’ 
there exist simple closed curves in the (x, y) plane such that a solution (x(t), y(t) 
of (1.1) can intersect any one of these curves only by crossing it from the domain 
exterior to the curve into the domain interior to the curve. Moreover through any 
point in the (x, y) plane sufficiently remote from the origin, there passes a curve 
with this property. 

Clearly if Cy denote a simple closed curve of the type just described and if C, 
is sufficiently remote from the origin so that through every point exterior to it 
passes a curve of the type described above, then every solution of (1.1) starting 
in the domain exterior to Cy must eventually cross into the interior of Cy and stay 
there. Thus, under conditions generally met in practice, (1.1) is of class D. 

It is well known that to a system such as (1.2) there corresponds a trans- 
formation of the (x, y) plane into itself. To see this let us consider the solution 
(x(xo, yo, t), y(%o, Yo, t)) of (1.2) which when ¢t = & is at the point (2, yo) of 
(x, y) plane. Let 


Zn = bo, & + nL), Yn = Yo, & + nb), 
for any integer, n. Since F and G are of period L in ¢, it follows that 
(1.4) Intm = + nL), Ynim = Ym, to + nL). 


If P,, denote the point (x, , yn), then we define a transformation T' of the (x, y) 
plane into itself by TP) = Pi. By 7” is meant the transformation that takes 
Pyinto P,. Clearly (1.4) is equivalent to T"*"P) = = More- 
over since solutions of (1.2) are continuous with respect to changes in initial 
values, T' is continuous. If we now consider the solutions of (1.2) as curves in 
(x, y, t) space, it becomes clear that to study the behaviour of the solutions of 
(1.2) we have only to study the transformation 7 of the (x, y) plane into itself. 
In particular a fixed point under the transformation 7” corresponds to a periodic 
solution of (1.2) of period nL. 


* See N. Levinson, On the Existence of Periodic Solutions for Second Order Differential 


Equations with a Forcing Term. Journal of Math. and Physics, Vol. XXII (1943), p. 41, 
Theorem II. 
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2. Maximum finite invariant domain 

If we consider the closed curve Cy, described above, it follows that under 
iterations of 7’ all points exterior to Cy are transformed into interior points. 
Moreover if 7’"Co be denoted by C, , it follows that C, lies in the domain interior 
to Cy). Thus C2 lies in the interior of C,, etc. Consider the closed domain 
interior to all C, ,n 2 0. Clearly this domain is finite and is invariant under T’. 
Moreover it is obviously the maximum finite invariant domain. We shall now 
prove the existence of such a maximum finite invariant domain for every system, 
(1.2) of class D. 

TueorEM I. Every system (1.2) of class D, possesses a closed domain, I, in- 
variant under T'. The complement of I is an open simply connected domain if the 
point at infinity is adjoined to the (x, y) plane. Under iterations of T every point 
in the complement of I except the point at infinity tends to I. I is the maximum 
finite invariant domain under T. 

THeorEM II. Every system (1.2) of class D has at least one fixed point under 
the transformation T. Thus (1.2) has at least one solution of period L. 

Proor oF THEOREM I. By the property, (1.3), of systems of class D and by 
the continuity of solutions of (1.2) with respect to changes in initial values, 
there exists an N > 0 such that if Ry represents the circle, 2° + y’ < R’, then the 
open simply connected domain 7’ Ry = Ry is interior to Ry and remains interior 
to Ry in further applications of J. From Ry C Roit follows at once that T°’ Ry = 
Rey is interior to Ry , etc. Let the closed domain interior to all Riy , where 
k = 0, 1, 2,--- , be denoted by J. 

Let Qo be an open simply connected domain in the interior of which is Ry and 
R,. Then by the property of systems of class D there must exist an m such that 
C Ry. Thus we have C Ro Applying we have 


Since Rv, > I as k > ~, it follows that as k > 
(2.1) Qu I 


From R, C Qo, it follows that Ro C Q_1. Proceeding with Q_: as we have 
above with Qp we find that analogous to (2.0) we have 


Thus Qyz-1 + I as k > ©, or applying T we find Qu. — TI. Combining this 
with (2.1) we have TJ = J. Thus J is invariant under T. The other state- 
ments of Theorem I follow immediately and thus we have demonstrated the 
existence of the maximum finite invariant domain under 7’. 

Proor or TuroreM II. As in the proof of Theorem I let the open domain, 
y< R’, be denoted by Ry. Clearly J C Ro and Ro, k 20. Con- 
sider the open set Ro + Ri + --- + Ry-1. A point P. is said to be occluded’ 


°G.D. Brrxnorr. Surface Transformations and Their Dynamical Applications. Acta 
Mathematica, Vol. 43, 1922, see p. 80. 
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by this open set if a simple closed curve can be drawn entirely within this set 
and enclosing P and J. Let Ko be the set of points occluded by this open set. 
Clearly Ko is simply connected. Let TKo = Ki. Then K; is the set occluded 
by Ri + Re +---+ Ry. Since Ryo D Ry, it follows that Ki C Ko. Since R, 
is a circle and T analytic, it follows that the boundary of Ko is everywhere ac- 
cessible. (In fact it is made up of analytic curve segments.) Thus by the 
Brouwer fixed point theorem, there exists a point in Ko fixed under T. Clearly 
this fixed point must be contained in /. 


3. Invariant closed curves and rotation numbers 


As with Birkhoff* we shall mean by a closed curve the boundary of a simply 
connected open continuum in the finite plane. The plane is regarded as com- 
pleted by the adjunction of the point at infinity. A closed curve invariant 
under T' is called an invariant closed curve. Clearly the boundary of the com- 
plement of J, the maximum finite invariant domain, is an invariant closed curve. 

In addition there may be contained in J any number of invariant closed 
curves. In the simplest case where the right members of (1.2) actually do not 
involve ¢ then any value of L is a period. In this case the invariant closed 
curves would be the limit cycles of Poincaré. 

Let us denote an invariant closed curve by C. First let us consider the case 
where C is a simple closed curve of length \. Consider all the solutions S- 
of (1.2) starting on C when ¢ = 4. Fort S ¢t S t + L, these solutions, S-, 
form a surface in (x, y, ) space which is bounded by C when t = t) andt = & + L. 
Thus this surface can be mapped on the closed torus. The problem of the solu- 
tions, S-, of (1.2) which emanate from C is reduced to the problem of the 
solution of a differential equation on a torus. 

Clearly the equation of the surface formed by the solutions of (1.2) emanating 
from C can be written as x = f(0, t) and y = g(6, t) where f and g are of period 
1 in 6 and of period Lint. In fact f and g can be so chosen that, for any fixed 
t = t,, @ is proportional to arc length on the curve, x = f(@, 4), y = g(0, ti). 
Differentiating x and y we find 

de dy _0gd0 , ag 
Using (1.2) and multiplying the first of the above equations by af/a@ and the 
second by dg/40 and adding we get 


dé 1 


00 00 
Let \; denote are length on the curve x = f(6, ti), y = g(@, 4). Then since @ 
is proportional to are length on each such closed curve we have 


0<Min ai s (4) + (2) = < 
= \00 00 


4 Loc. cit. p. 79. 
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Thus the right member of (3.0) is bounded and is periodic of period 1 in @ and 
Lint. Its solutions as already stated are best represented geometrically on a 
torus. 

Associated with such a differential equation, or with a family of curves on the 
torus, or with a transformation of a simple closed curve into itself is a rotation 
number, p. This number is simply the average advance of 0, for an advance 
of t = L. 

If p is rational and of the form p/g where p and qg have no common factors, 
then (3.0) has solutions of period gl in ¢. Any non-periodic solution must tend 
toward such a periodic solution as t > ©. In this situation (1.2) has q™ sub- 
harmonics among its solutions. 

If p is irrational there are two possibilities. One of these possibilities is 
termed the singular case. In case the right member of (3.0) is reasonably well 
behaved, for instance twice differentiable with respect to 6, the singular case is 
ruled out.© Under these latter circumstances the solutions of (3.0) are of the 
form 


tp t tp 
w= 


where’ H(u, v) is periodic in u and v of period 1 and ¢ is an arbitrary constant. 
Clearly x and y are almost-periodic functions of ¢ in this case. In fact they are 


of the form h ¢ | 2) where h(u, v) is of period lin uandv. In the case of cer- 


tain systems which depart only slightly from linear Kryloff and Bogoliuboff® 
show that this is the case. 

In the singular case there are solutions which trace out curves that together 
with their limit points intersect any meridian of the torus in a perfect, no-where- 
dense set of points. 

There are cases of (1.2) of considerable practical interest where in effect, aside 
from a fixed point, there is only one invariant closed curve, C. If C were a 
reasonably well behaved curve the results just given would afford a complete 
qualitative solution of such cases. Even where there are several invariant 
closed curves the qualitative situation would be well in hand if these curves 
were known to be well behaved. Actually there is no indication that C need 
be a reasonable curve. 

In fact Birkhoff® has given an example of an analytic transformation of the 


’Porncar&. Collected Works. Vol. I, p. 137, Chapter XV. 

6A. Densoy. Sur les courbes definies par les equations differentielles a la surface du tore. 
Journal d. Math. Vol. 11 (1932), p. 333. 

™P. Bonu. Uber die Hinsichilich der Unabhangigen und Abhangigen Variabeln Periodische 
Differential Gleichungen. Erster Ordnung. Acta Math. Vol. 40 (1916), p. 321. 

Loc. cit. 

°G. Birxuorr. Sur quelques Courbes Fermées Remarquables. Bull. de la Soc. Math. de 
France. Vol. 60, 1932, p. 1. 
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plane into itself which leaves a closed “curve,” J, invariant. J divides the 
plane into two invariant open simply connected continua, S; and S, where to 
S, is adjoined the point at infinity. Every point of J is a limit point of §; 
or of S, or of both. Those points of J accessible from S, we denote by J., 
and from S; by J;. Associated with the transformation which carries J, and 
J; into themselves are rotation numbers p, and p;. What Birkhoff showed is 
that p. need not equal p;. This indicates how complicated J can be even 
though 7’ is analytic. Unfortunately there is at present no basis for excluding 
curves of this type from the invariant closed curves of (1.2). 


4, Fixed points 


The solutions of (1.2) can be regarded geometrically as a family of curves in 
(x, y, t) space which, due to the periodicity of F(x, y, t) and G(a, y, ¢) in t need 
only be studied between the two planes ¢ = & and? = & + L. Any point, 
(xo, Yo), in the (x, y) plane when ¢ = % is carried into a point (2, y:) in the 
(x, y) plane when ¢t = t& + L by the solution of (1.2) emanating from (2 , yo). 
This as we have seen defines the transformation 7’. 

Let us consider the change in area in the (x, y) plane under the transformation 
T. Let (x(20, yo, t)), (y(ao, yo, t)) denote the solution of (1.2) at (xo, yo) when 
t = &. Let the Jacobian of (x(xo, yo, t), y(%o, yo, with respect to (a , yo) 
be A(t). That is 


(4.0) A(t) = | 9% 


Differentiating (4.0) and using (1.2) we find 
dd (aF , aG 
Integrating and using A(&) = 1, we find that A(& + L), the Jacobian of the 
transformation T for the point (2, yo) is given by 
O21 
(4.1) (2:31) dys = exp| By dt}. 
OX 


Thus the element of area dx dy is carried by T into J((a1 , y1)/2%0 , Yo)) dyo 
Now let us study the transformation 7 in the neighborhood of a fixed point 
of the transformation. This will be facilitated by making a transformation of 


coordinates which takes the fixed point into the origin. That is let (z(), g()) 


be a solution of (1.2) of period L. Then the point (Z(t), G(to)) is a fixed point 
under 7. Let Po be the point (%(4) + wo , J(to) + vo) in the (zx, y) plane. Denote 
TP, by P; with coordinates + G(to.) + It is well known that the 


all 
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solution (x(¢), y(¢)) starting at Po when t = t) can be represented by a power 
series in Up and v with coefficients functions of t. Thus 


a(t) = E(t) + + + + ca(t)uovo + 
and similarly for y(t). In particular setting t = & + L we get 
U1 = + by + --- 
1 = Cuo + dy + --- 


where the terms not explicitly given in the right members are of degree two or 


higher in and . 
If we denote (Z(t) + ww, + v0) by (ao, yo) and + um, (to) + 


Using (4.1) and (4.2) we have 
ab| _ (aF(@, 9, t) | 
cd = ( )a 


Thus ad — be > 0. 
The transformation (4.2) has been very much studied. For small values of 


uw and v, its character is determined by its linear terms. That is, the trans- 
formation can be characterized by the roots of the equation 


(4.4) (a — p)(d — p) — be = 0. 


If one or both roots of (4.4) is 1, this means that (Z(t), J(t)) is a multiple 
fixed point of 7. A slight change in the parameters of the equation (1.2) will 
ordinarily separate these points and thus the situation requires no special treat- 
ment but can be studied as a limiting case of simple fixed points. We shall 
therefore assume the roots of (4.4) to be different from 1 in the discussion that 
follows. We shall also assume the roots different from —1, which indicates 
multiple fixed points: under 7”. 

Let us denote the roots of (4.4) by pi and p.. Since ad — be > 0, it follows 
from (4.4) that ppp > 0. We now characterize the periodic solution, (Z(é), 
g(t), or what is the same, the fixed point, (Z(t), (t)), as completely stable if 
|| <1, |e] <1. In this case the point (wm, 1) is nearer to (0, 0) than 
(uo, 0). Iterations of 7’ bring (uo, vo) nearer and nearer to (0, 0). That is, 


(4.2) 


(4.3) 


all solutions of (1.2) near (Z(t), g(t)) move closer and closer to ((), g(é)) at. 


t— 


Similarly if | p:| > 1, | | > 1, then we shall term the solution (Z(¢), 9(¢)) 


completely unstable. 
If p,: > 1 > p2 > O, then we shall call the solution (2(t), 9(¢)), directly unstable. 
If pp < —1 < p, < 0, then the solution (Z(¢), 9(¢)), will be called inversely 


unstable. 
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The only other possibility is the case | p:| = | p2:| = 1. In this case p, = 4, . 
This is a very important case in conservative systems. In this case stability 
can be investigated only by considering higher powers of wo and v in (4.2). 
Since in this case stability is not determined by the linear part of the trans- 
formation we shall call it the wndetermined case. 

In the case of directly or inversely unstable points there is a curve invariant 
under T passing through the fixed point’®. Points on this invariant curve move 
farther and farther away from the fixed point under iterations of T. There 
is another invariant curve passing through the fixed point, points of which 
move toward the fixed point under iterations of 7. Aside from the case where 
pi Or pe are +1, the above classification of stability is exhaustive. 


5. The Fixed Point Equation 


A fruitful tool in the investigation of the transformation, 7’, involves the use 
of the vector field in the (x, y) plane where from each point, Po , there emanates 
a vector directed toward and terminating in TP) = P,. The application we 
shall make here is a special case of results of Birkhoff.” : 

Clearly the number of revolutions made by PoP; as Po traces out a closed curve 
in the (x, y) plane must be an integer since Py returns to its starting position. 
This integer is called the index of the curve. The curve, we assume does not 
pass through any fixed points of 7. 

We shall first show that the closed curve which is the boundary of Ko, the 
continuum introduced in the proof of Theorem II, has index 1. Clearly this 
curve is free of fixed points. Since 7K» C Ky, the index would certainly be 1 
if Ko were the interior of a circle. For in this case the vector P,P, must make 
an angle of less than 90° with the radius from the center of the circle to Py , and 
the radius makes one revolution when Py does. 

We now consider the general case of Ky and gradually deform it into the 
interior of a circle. It follows that since the index of the boundary changes 
continuously with the deformation and since it is an integer, it must be 1. 

(We assume in what follows that there is no curve in the (x, y) plane each point 
of which is invariant under T.) We enclose each fixed point by a small circle. 
We join the circles together by curves so that if the curves be regarded as cuts, 
the curves and circles together form a single closed curve. Shrink the boundary 
of Ky down to this closed curve. Since the index will change continuously, it 
must remain 1 for the single closed curve made up of the circles and several 
curves joining them. In determining the index of the single closed curve, each 
curve segment, or cut, joining a pair of circles is traversed first in one direction 
and then the other. Thus the net effect of these segments on the index is zero. 


10 J, HaDAMARD, Sur l’iteration et les solutions asymptotiques des equations différentielles. 
Bull. de la Soc. math. de France, Vol. 26, 1901. Actually there is an invariant curve if 
pi and pz are real and unequal. 

11 Dynamical systems with two degrees of freedom. Trans. of the Am. Math. Soc., vol. 18 
(1917), p. 287. The method is due to Potncar&, Collected Works, Vol. 1, p. 25. 
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Therefore the sum of the indices of the several circles enclosing the fixed points of T 


must be 1. 
Let Po be a point on the circumference of such a small circle. Let ¢ be the 


angle PoP; makes with the z axis, or what is the same thing, the u axis. Then 
XH v1 — 
or using (4.2) 


Clo + (d — + 


If the radius of the small circle is r and if the angle the radius to Py) makes with 
the u axis is 6, then using (5.0) we get 


-1¢ cos 6 + (d — 1) sind + 7[---] 
(5.1) (a — 1) cos 6+ bsinéd + 


From (5.1) 


(5.2) (a — 1)\(d — 1) — be + 
dé [(@ — 1) cos0+ bsind + + [cos6 + (d — 1) 


Since we have excluded the case where (4.4) has roots equal to 1, 
(a — — bc ¥ 0. 


Thus it is clear from (5.2) that do/d@ is of fixed sign which it takes 
from (a — 1)(d — 1) — be. Also the denominator of the right member of 
(5.1) vanishes twice as 6 goes from 0 to 27. Since ¢ is either monotonically 
increasing or decreasing, this means that ¢ changes by 2x or —2x according to 
the sign of (a — 1)(d — 1) — bc when P) traverses the circle. In other words 
each circle enclosing a fixed point is of index equal to [(a — 1)(d — 1) — be]. 
A consideration of the relationship of this sign to the location of the roots of 
(a — p)(d — p) — be reveals immediately that in the cases completely stable, 
completely unstable, inversely unstable, and undetermined, (a — 1)(d — 1) — 
be > 0. In the case directly unstable, (a — 1)(d — 1) — be < 0. Since the 


sum of the indices is 1, we have 
TuroreM III. If C denote the number of completely stable, completely unstable, 


and undetermined points of T,, if I denote the number of inversely unstable points, 
and D the number of directly unstable points we have 


(5.3) 


Although (5.3) was proved for 7’, the proof would obviously apply to the 
transformation 7” where n is any integer. Thus (5.3) applies to the solutions 
of (1.2) of period nL. 
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6. A Theorem on Subharmonics 


A solution of (1.2) is called subharmonic if its least period is equal to gZ where 
q > 1isaninteger. We shall now show, for any gq, that not all the subharmonic 
solutions of least period of gZ can be stable. To be more specific we shall show 

TuroreM IV. If (1.2) is of class D and possesses N subharmonic solutions 
of least period qL, then N = 2kq, where k is an integer. Moreover kq, that is one 
half, of the solutions are directly unstable. 

Let A(m) represent the number of completely stable, completely unstable, 
inversely unstable, and undetermined solutions of (1.2) of least period mL, and 
let D(m) represent the number of directly unstable solutions of least period mL. 

Clearly if m, is a factor of m, then the solutions of (1.2) of period mL appear 
among those of period mL. In other words among the fixed points of T” are 
those of T™. Let 


where p;,j = 1, 2, +--+, , are prime numbers. Then the total number of solu- 
tions of (1.2) of period mL is given by e 


m1 


71=0 in=0 


Applying (5.3) to the transformation 7” we find 


(6.0) [A(pitpi? pit) — D(pitpi? --- pi] = 1. 

If we now apply (6.0) to the case where m = p;* ‘pz? --- pn and subtract the 
equation so obtained from (6.0), we get 


2=0 


If we consider (6.1) for m = pi'pz?"p3* --- pn" and subtract the result so 
obtained from (6.1) we get 


Proceeding in this way we finally get 
A(pi'p2* pn®) = D(pi*p2? - pr”). 
In other words 
(6.3)  A(m) = D(m). 


Let D(q) be the number of directly unstable fixed points under 7 but not fixed 
for T” where n > 0 is less than g. Consider such a fixed point P. Then P, 
TP, T’P, --- T* gp: 4 are all distinct. Thus the fixed points under consideration 


a 
(7 
( 


a 


| 
| 
| | 
= n3 nin | 
| 
a 


NON-LINEAR DIFFERENTIAL EQUATIONS 733 


fall into mutually exclusive sets of g points. That is D(q) = kq where k is an 
integer. Now from (6.3) it follows that A(qg) = D(q). Thus the total number 
of points fixed under 7" and not 7", 0 <n < q, is D(q) + A(q) = 2kg. This 
completes the proof of the theorem. 
7. Some Examples 

The simplest possible maximum finite invariant domain is that consisting of 
a single point. This is actually the case for the following two equations”, 
(7.0) and (7.1): 
(7.0) + + = eft) 


where e(/) is periodic, zf(x) > 0 for x ¥ 0, and / f(x) dx = ~; 


(7.1) + F(@) + = 
where F(y) > 0 for y ~ Oand F(y) > ~ asy—> 2. 
The differential equation 
+ f(x, + g(x) = elt), 
under the conditions enumerated in the paper referred to in connection with 
this equation in §1, and with the further condition 


7.2) f(x, y) + >0, (x0, 


has its maximum finite invariant domain of zero area. To see this we note that 
the left member of (7.2) is —(@F/dx + aG/ay) in our general notation. Thus 
by (4.1) the Jacobian of 7 under these conditions is always less than 1. Using 
the notation in the proof of Theorem II we note that I is the inner limiting 
set of the domains enclosed by the curves K,.. Denoting by Ao the area bounded 


by Ko, 
(73) area of J = lim Jf (re, + a]. 


But by (7.2), for all (a, ye in Ky there exists a 6 > 0 such that 
+2 at = 6. 


Thus (7.3) becomes 
area of J < lim Stall = 0. 


“%N. Levinson. On a Non-Linear Differential Equation of the Second Order. Journal 
of Math. and Physics, Vol. 22 (1943), p. 181. 
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The statement that J is of zero area clearly implies that every point of J is a 
limit point of the open continuum exterior to 7. This case includes the impor- 
tant case of 


e& + g(x) = e(d), e>0. 


Being of zero areas, J is of a comparatively simple character and a number of 
special results can be given. For the present we point out a few simple possible 
types of IJ of zero area. The case where J is a single point, which arises for 
instance with equations (7.0) or (7.1), signifies that there exists one periodic 
solution of period L toward which all other solutions tend. 

The case where J is a segment of a curve with three fixed points is of con- 
siderable interest. Let J consist of the curve segment ABC where A and C 
are the end points of the segment and A, B, and C are all fixed under T. A 
and C are completely stable fixed points, and B is directly unstable. ABC is 


shown in Fig. 1. 


A 


Fie. 1 


In case parameters in the differential equation are varied, ABC will change 
its shape, but we shall suppose ABC remains topologically equivalent to a line 
segment. This turns out to be the case in many situations of practical import- 
ance. What does happen though, very often, is that as the parameters in the 
differential equation change B will move up gradually to the point C, and will 
actually come into coincidence with C. Then as the parameters are varied 
further, I will consist merely of the point A. 

If we consider the periodic solution corresponding to the fixed point C, what 
we find is that this solution remains stable until B comes into coincidence with 
C, whereupon the only stable solution is the one corresponding to A. This 
situation is well known in practice as the “jump” phenomenon whereby a non- 
linear system may with a small change in parameter jump from one steady state 
to another. 

Another J of interest is shown in Fig. 2. Here 0 is the only fixed point under 
T and is completely stable. We have TA = B, TB = C, TC = A and similarly 
T’D = T’9E = TF = D. That is A, B, C, D, E, and F are fixed under 7”. 
Moreover under 7°, A, B, and C are completely stable and D, E, and F directly 
unstable. In this case the rotation number of J is 27/3. By Theorem IV we 
see that this case corresponds to the simplest J for which we can have a sub- 
harmonic of period 3L. 

Another J of interest is shown in Fig. 3. Here the portion of J from O through 
D continues on by wrapping itself an infinite number of times around the curve 
segment containing A. The same is true of the two other branches of I going 
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through EZ and F. As before TA = B, TB = C, TC = A and similarly for D, 
f, and F. We also have TG = H, TH = 1, TI = J, TJ = K, TK = L, and 
TL = G. That is, these points are fixed under T°. Thus in this case A, B, 
and C are inversely unstable under T* and D, E, and F are directly unstable 
under T°. Here although the rotation number is 21/3, we have no stable third 


2 


order subharmonics. However corresponding to G we will have a stable sub- 


harmonic of order 6. pi 
Another case, a singular case which we cannot rule out as a possibility, has an 


I which has only one fixed point, 0, under 7’ or any iterate of T’. Topologically 
I is equivalent to certain radii of a circle with 0 as its center. These radii 
emanate from 0 and terminate in a perfect no-where dense set of points on the 


circumference of this circle. A solution corresponding to a point of this perfect 
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set is of the type Birkhoff calls discontinuous recurrent. It would be very 
desirable to have a criterion for ruling out this type of J. 
As a final example let us take the van der Pol equation with a forcing term 


u(x” — +2 = ed), 


with » large. In this case J is not of zero area. If e(t) is not too large, and if 
the period of e(t) is likewise not too large, there is only one fixed point, 0, under 
T. Moreover 0 is completely unstable. Furthermore if we consider a circle, 
So , of small radius about 0, then the open simply connected domain, Sp + T'S) + 
T’S) +--+, is invariant under T. J minus this open domain is a “curve” 
every point of which is a limit point of the open domain exterior to the curve 
or the open domain interiot to the curve. As we have already stated, we can 
unfortunately not say anything about the analytic character of such a curve. 
This curve could conceivably be as bad as the curve, J, in the example of Birkhoff 
already cited. 


8. Completely stable invariant closed curve 


A “curve,” J, is an invariant completely stable closed curve under T' if (a) 
it divides the plane into two open simply connected invariant continua, S; and 
S. where to S, is adjoined the point at infinity; (b) every point of J is a limit 
point of S; or of S, or of both; and (c) for any small ¢ there exists an open con- 
tinuum, 0(e) containing J and with distance from J less than e and such that 


lim T"(O(e)) = J. 

A curve is said to be an invariant completely unstable closed curve if it is an 
invariant completely stable closed curve under T™*. Using occluded sets much 
as in the proof of Theorem II, it is easy to show that an invariant completely 
stable closed curve is contained in an open set Oy , arbitrarily close to J, such 
that Oo > TO, > TO; etc. Moreover O) can be chosen so that its exterior 
and interior boundaries are simple closed curves. 

The curve, J, considered in the last example of the preceding section, is clearly 
an invariant completely stable closed curve. 

We shall now prove 

THEOREM V. J is an invariant completely stable (or unstable) closed curve. 
If J contains N fixed points under T*, q = 1, but not fixed under T’,0 <j <4, 
then N = 2kq where k is an integer. Moreover kq of the fixed points are directly 
unstable. 

First consider the open simply connected domain consisting of Oo , described 
above, and all points in S; , the interior of J. Let Ao(q) be the number of com- 
pletely stable, completely unstable, inversely unstable and undetermined points 
invariant under 7", and no smaller power of 7, and contained in the open simply 
connected domain under consideration. Similarly Do(q) is the number of di- 
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rectly unstable points. Then using exactly the same procedure as in the proofs 
of Theorem III and IV, we have 


Ao(1) = 1 + Do(1) 
Ao(q) = Do(Q), q>l. 


Consider next the open simply connected domain bounded by the interior 
boundary of Oo. Call the invariant points for this domain A;(q) and D,(q), 
all these points being fixed under 7 but not under any smaller power of 7. Now 
applying 7” * we get as before 


A,(1) = 1 + Di(1) 
Aig) = Dig), 
Subtracting (8.1) from (8.0) we have for any q 
A(q) = D@) 


where A(q) and D(q) are the invariant points of T%, but not T,,0 <k <4q, 
contained in J. This result, as with (5.3) in the proof of Theorem IV, leads to 
statement of Theorem V. 


(8.0) 


(8.1) 
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I COMPLETENESS THEOREMS OF PALEY-WIENER TYPE 


By Harry 
(Received December 14, 1943, revised January 14, 1944) 


| he Various writers have obtained criteria for the completeness of a set {y,} in 
i Hilbert space H, ((1], [3], [5, 100], [6]). These state that if {y,} is “near” a 
| complete set {z,} in some suitable sense then {y,} is complete. In this paper 
| we contribute a further result of this character, and discuss some corollaries, 


Our principal criterion is the following. 

TuHEoreM 1.1. Let {rn}, {yn} be sequences in H. Suppose that for fixed 
Mm, < 1,0 S < 1 the following inequalities are satisfied by every 
finite sequence of numbers {an}: 


(1.1) || Zan(an — Yn) |? Zante + 


Then if {x,} ts complete so is {yn}. 

For \: = 0 this result is due to Paley and Wiener [5, 100], the form to Boas 
(2, 469]. It should be remarked that while our restrictions are less severe due to 
the presence of the A» term, our conclusion is correspondingly weaker. For 
Paley and Wiener obtain also an expansion theory, and this we lack. 

To prove our theorem we may suppose that \1 = A, = A, since we may re- 
place Ai and A, by = max (Ai, Ax) < 1. We shall show that (y,, z) = 0, 
nm = 1, 2;--- implies z = 0. Since {z,} is complete there exist coefficients an, 
such that 


z = lim 


the sums are finite. Let = >> dnstn, We = , noting that (z, wx) = 0 
for all k. By (1.1) and Schwarz’s inequality | 

= 2R(z, we) = 2R(z, — z, we) S 2 || ze — || well, 


whence 
Then 0 || w: || < Letting k it follows that || ws || 0. 


Passage to the limit oo 20 ) in (1.2) yields 0 < || z || < 0, whence z = 0. 
2. Further criteria 


Note that under the hypotheses of Theorem 1.1 the sets {x,}, {yn} are both 
complete or both incomplete. The following criterion can also be deduced from 
the main theorem. 
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TurorEM 2.1. Jf the inequalities 
(2.1) || — Ym) || S Zante || + |] Zanyn ||} 


we satisfied by every finite sequence {a,} for a fixed  < 34/2, then both sets are 
complete or both incomplete. 
Proor: The inequality (A + B)* < 2(A® + B’) shows that (2.1) implies 


|| — yn) ||? S |? + || ||). 


If 2)” < 1 then (1.1) holds with \; = \» = 2d” < 1; and the sets {z,} and {y,} 
are both complete or both incomplete.’ 
Of particular interest is the result obtained from Theorem 1.1 if we require 
that {z,} and {y,} both be orthonormal sets (denoted hereafter by ON). 
TuzorEM 2.2. Let {xn}, {yn} be two ON sets in H such that the inequalities 


5 Yn) ay |? 


hold for all finite sets {an} and a fixed » > 0. Then both are complete or both 
incomplete. 


3. The bounds on ); , Az 


It is clear that the value \; = 1 is not admissible in Theorem 1.1; for let {y,} 
be the sequence {0, 0, ---}. 

Now let z, be any complete ON set. Define 2, = Zn41 — 2ny Yn = 2Zn41- 
Then {z,} is complete, {y,} incomplete, and 


(3.1) || — yn) ||? || 


This proves that 2 = 1 is inadmissible. (This example was suggested by a 
similar one in [3, 852-3]). 

The following questions remain open. (i) Is it true that for every complete 
set {z,} there exists an incomplete set {y,} such that (3.1) is true? (ii) Are the 
bounds on A and yw in Theorems 2.1 and 2.2 best possible? 
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THE ANALOGUE OF BROMWICH’S THEOREM FOR INTEGRAL 
TRANSFORMATIONS 


By H. L. GaRnaBEDIAN 
(Received December 23, 1943; Revised April 7, 1944) 


1. Introduction 


In essence, Bromwich’s theorem for matrix transformations [1] provides suf- 
ficient conditions in order that a method of summation with infinite matrix of 
reference shall include Cesaro summability, a method of summation with finite 
matrix of reference. Various additional proofs of this theorem may be found 
in the mathematical literature, the most recent one by Garabedian [2], and 
also a number of applications, notably by Moore [3], Morse [4], and Garabedian 
[5]. 

This paper involves transformations of the type 


(1.1) it) [ t)s(t) dt, 


which associate with certain functions s(t) the functions o(¢) determined by a 
given kernel K(x, t). Restrictions on s(¢) and K(z, t) for the present investiga- 
tion will be specified later in this section. We shall also be concerned with trans- 
formations of the type 


(1.2) a(x) = t)s(d) dé, 


obtainable directly from (1.1) in the case that K(x, t) = 0,t > 2x. The trans- 
formations (1.1) and (1.2) (or the kernels in (1.1) and (1.2)) are said to be of 
infinite reference and finite reference respectively. An integral transformation 
or its kernel is said to be regular if lim,_,,, s(x) implies the existence of limz.,,, o(x) 
and the equality of the two limits. 

In this paper it will be of convenience also to use transformations of the type 


(1.3) k(x, ta(t) dt, 


where s(#) = [ a(u) du. 


We follow the procedure of Knopp [6] in setting down agp a on s(t) 
and K(z, t). 


Let S be the class of functions which satisfy the following three eon: 
(i) s(t) ts defined for t > 0, 
(ii) s(t) is bounded for 0 < t S a, for every a > 0, 
(ili) s(t) as Lebesgue integrable on the interval (0, a), for every a > 0. 
740 
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We designate by S, the subclass of functions in S for which s(t) is bounded, 


{> 0. 
let K(x, t) be defined and measurable in the quadrant Q: (x > 0, t > 0). 


Then we require that 
af xe» | dt shall exist for almost all x and belong to Sy ; in particular 


|dt< M, 
M independent of z, 
tim [| KG, |at = 0, 
for every fixed a > 0, 
(iii) lim K(a, t) dt = 1. 


Knopp [6] has proved that the conditions (1.4) ensure the regularity of the 
transformation (1.1). In this connection it is understood that s(t) belongs 
toS. For this paper we do not require conditions of regularity for transforma- 
tions of the type (1.3). 

The regular transformation 


which defines Cesaro summability (C, n), n > 0, for integral transforms, is of 
particular interest in this paper. 

It is the main object of this paper to establish the analogue of Bromwich’s 
theorem for integral transformations, that is, to provide sufficient conditions 
in order that a method of summation defined by a transformation of the type 
(1.3) shall include Cesaro summability of positive integral order. The proof 
is given in §2. Bromwich [1] has established a theorem resembling the theorem 
of §2, but only for the case of (C, 1) summability and with less generality and 
simplicity than achieved by this writer. In §3 a kernel k(x, ¢) which fulfills 
the conditions of the theorem of §2 is exhibited. In §4 the connection of the 
Bromwich theorem for integral transformations with the problem of the flow 
of heat in an infinite rod whose surface is a non-conductor is discussed. 


2. The analogue of the Bromwich theorem for integral transformations 


This section is devoted to the statement and the proof of the analogue of the 
Bromwich theorem for integral transformations. 


THEoREM 2.1. Let the integral a(u) du, a(u) in S, be summable (C, n), 
for some positive integral value of n, to the value 1. Let k(x, t), together with its first 
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n + 1 derivatives, be defined and measurable in the quadrant Q: (x > 0, t > 0). 
Let K(x, t) = (—1)""#"(a"*"*/at"™)k(z, t)/T'(n + 1) and require that 


(i) [ ike, t) | dt shall exist for almost all x and belong to S, ; in particular, 


[ixeola<m, 


M independent of x, 
tim [| |at =0, 
for every fixed a > 0, 
(iii) lim is K(a, t) dt = 1, 
(iv) lim K(x, ) = 0, 
Then the integral 
k(x, #)a(t) dt 
exists for almost all x, and . 


lim k(x, t)a(t) dt = 
We are concerned with the Cesaro transformation 
(2.1) é"y(t) = [ (t — u)"a(u) du, 
and the transformation 
(2.2) 2(z) = [ k(x, t)a(t) dt. 


We wish to determine conditions on the kernel k(z, ¢) so that lim,.,, y(x) = | 
shall imply lim,.,,, z(z) = J. In this connection the problem of expressing z 
in terms of y presents itself. We do this formally at first, making all necessary 
assumptions to determine the form of the required transformation. Our method 
of formal procedure will be to solve (2.1) for a in terms of y and then‘substitute 
in (2.2), obtaining z in terms of y. 

To solve (2.1) for a in terms of y we observe first of all that the expression 


[ (¢ — u)"a(u) du/T(n + 1) is the (n + 1)** integral of a(u) in the Liouville- 
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Riemann theory of fractional integration. Accordingly, we have 
(23) = Tn + lal), 


this formula being valid for almost all ¢. 
Now, eliminating a between (2.2) and (2.3), we have 


1 
2(x) = rin +1 [ k(x, )D?**{t" y(t)} de. 


Integrating by parts in this ioe we get 


on the assumption that k(x, t)D?{t”y(t)} vanishes ast—>0 and ast— ~. Fi- 
nally, after other assumptions of this nature and after n additional integrations 


by parts, we obtain 


art 

It is clear that this is not a valid transformation from y to z unless we impose 
restrictions of unwarranted severity on the class of functions a which we are 
considering. Accordingly, we shall look for a means of obtaining (2.4) directly 
without further limiting the class of functions ain S. To this end we eliminate 
y between (2.1) and (2.4) and attempt to derive (2.2) without adding to the 
orginal restrictions on a. Thus, we have at the outset 


n+1 n+1 
@5) = [ (t — u)"a(u) du dt. 


The integral (2.4), and consequently (2.5), exists by virtue of condition (i) of 
Theorem 2.1 and the boundedness and integrability of y. Then, we enlist the 
aid of the theorem of Fubini [7] to justify an sie ete in the order of integra- 


tion in (2.5) to obtain 


(2.6) ate) [ (t —u)” t) dt du. 


+ 1) 
We wish now to prove that 
n+1 n+1 


Using condition (iv) of Theorem 2.1 and integrating by parts n times in the 
left member of (2.7) we get 


Tin + 1) [ (t u) k(a, t) dt = fé t) dt = u). 
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From this relationship and (2.6) we have 
(2.8) 2(x) = [ k(az, u)a(u) du, 


which is identical with (2.2) and which we have shown to exist for almost all x. 
This establishes the transformation (2.4) from z to y without change of our 
original hypotheses. 

It remains to prove that lim,.,, z(z) = 1. To this end we need merely to re- 
quire that (2.4) be a regular transformation, since lim,.,,, y(z) = 1 by assumption. 
The requirements (1.4) for regularity of a transformation of the type (2.4) 
are fulfilled by virtue of conditions (i), (ii), and (iii) of our theorem. Our 
theorem is then established. 


3. A kernel which fulfills the conditions of Theorem 2.1 


In this section we show that the kernel k(x, t) = e '* fulfills the conditions 
of Theorem 2.1. This kernel when associated with the transformation (1.3) 
generates an integral transformation which is the analogue of one of the 
Dirichlet’s series transformations [5] in the field of matrix transformations. The 
kernel k(z, t) = ¢€* provides the analogue of the Abel transformation in the 
. field of matrix Theainrrsa It can be shown that the Abel kernel also 
fulfills the conditions of Theorem 2.1, but the proof is omitted here. 

In order to check the conditions of Theorem 2.1 for the kernel k(x, t) = ae 
we need the formula 


= (—1)"H,(2e*, 
where H,(z) is the n“ Hermite polynomial. Thus, we have 


and consequently 
t) nl aH i) = (tx ) Je 
Now, we have 
a az—1/2 


whence we conclude that lim,.,,, I | K(a, t)|dt = 0. Accordingly, condition 


(ii) of Theorem 2.1 is fulfilled. 
From (3.2) we have 


[ uu" | | du, 
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Now, | Ha+a(u) | S Pai(u), where P,4:(u) is H,:(u) with all coefficients posi- 


tive. Since [ u"Pralue™ du exists and is independent of x, we conclude 


that [ | K(z, t) | dt < M, M independent of x, and hence that condition (i) 
is satisfied. 

With the aid of (3.1) it is clear that condition (iv) is fulfilled. 

Finally, we consider 


1 n —u2 (-1)"" n +1 —u? 
[ du = [ wo: 


and observe that n integrations by parts transforms the last integral into 
- [ d(é“") = 1. This proves that (iii) holds and completes the discussion of 
this section. 
4, Application to the rigorous solution of the problem of the flow of heat in an 
infinite rod 

We consider an infinite rod whose surface is a non-conductor, and we suppose 
that any cross section of the bar is so small that the temperature is constant in 
that cross section. We suppose further that the initial distribution of the tem- 
perature is given by the function f(x). It is required to determine the tem- 
perature of any point of the bar at any later instant. The differential equation 
dominating the unsteady flow of heat in one dimension is 


au _ 

ot dx?’ 

where u(x, ¢) is the temperature function. We seek a solution of the differential 

equation subject to the condition. 


(4.2) lim u(x, t) = f(z), 


(4.1) t>0, 


provided that f(x) is continuous. If we assume simply that f(x) is Lebesgue 
integrable on the interval (— ©, «), the limit in (4.2) is equal to [f(z + 0) + 
f(t — 0)] wherever this expression has a meaning; to f(x) wherever f(x) is con- 
tinuous; and to f(x) for almost all values of z. 

Using a classical formal procedure we find that the integral 


(43) [ [4@ cos + sin dg 

may afford a solution of the differential equation. This integral must tend to 
{(e), if f(z) is continuous, as t > 0. This raises the question of the possibility 
of expressing f(x) in the form 

(4.4) [ [4@ cos + sin dg. 
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The formal development of f(x) in terms of a Fourier integral is given by 
(4.4) where 


= [ 40) cos™ 
(4.5) 


BiB) 


. Br 
f(A) sin dy. 
Substitution of A and B, as given in (4.5), into (4.3) gives the expression 


(4.6) = cos (. — ar ap. 


Integration with respect to 8 in (4.6) identifies this integral with the classical 
solution of Poisson to the proposed problem. 

It is easy to show that the integral (4.6) converges for ¢ 2 to > 0, and all 
values of x, and defines in that region a continuous function w(x, t) which satis- 
fies the differential equation (4.1). In order to satisfy condition (4.2) we have 
merely to show that the kernel k(1/t, 8) = e *** satisfies the conditions of Theo- 
rem 2.1, since, by Fejér’s theorem [8, p. 29], the integral (4.4) is summable 
(C, 1) to 4[f(a + 0) + f(x — 0)] wherever this expression has a meaning; to f(z) 
wherever f(x) is continuous; and to f(x) for almost all values of x. Since, in §3, 
we proved that the kernel k(1/t, 8) = ~#** fulfills the conditions of Theorem 2.1 
we conclude that lim, u(x, ¢) is equal to 4[f(z + 0) + f(x — 0)] wherever this 
expression has a meaning; to f(x) wherever f(x) is continuous; and to f(x) for 
almost all values of 2. 

The problem under discussion in this section affords an interesting application 
of Theorem 2.1. It should, however, be noted that the solution is valid under 
much less restrictive conditions on f(x) at infinity than imposed by this writer 
[8, p. 31]. 
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A SIMPLE INTRINSIC PROOF OF THE GAUSS-BONNET FORMULA 
FOR CLOSED RIEMANNIAN MANIFOLDS 


By SHIING-sHEN CHERN 
(Received November 26, 1943) 


Introduction 


C. B. Allendoerfer’ and W. Fenchel” have independently given a generaliza- 
tion of the classical formula of Gauss-Bonnet to a closed orientable Riemannian 
manifold which can be imbedded in a euclidean space. Recently, Allendoerfer 
and André Weil’ extended the formula to a closed Riemannian polyhedron and 
proved in particular its validity in the case of a general closed Riemannian 
manifold. In their proof use is still made of the imbedding of a Riemannian 
cell in a euclidean space. The object of this paper is to offer a direct intrinsic 
proof of the formula by making use of the theory of vector fields in differentiable 
manifolds. 

The underlying idea of the present proof is very simple, so that a brief summary 
might be helpful. Let.” be a closed orientable Riemannian manifold of an 
even dimension n. According to details to be given below, we define in R” an 
intrinsic exterior differential form Q of degree n, which is of course equal to a 
scalar invariant of R" multiplied by the volume element. The formula of Gauss- 
Bonnet in question asserts that the integral of this differential form over R" is 
equal to the Euler-Poincaré characteristic x of R". To prove this we pass from 
the manifold R” to the manifold M°”* of 2n — 1 dimensions formed by the unit 
vectors of R”.* In M’""' we show that Q is equal to the exterior derivative of a 
differential form II of degree n — 1. By defining a continuous field of unit vec- 
tors over R” with isolated singular points, we get, as its image in M ent a sub- 
manifold V" of dimension n, and the integral of 2 over R" is equal to the same 
integral over V". The application of the theorem of Stokes shows that the 
latter is equal to the integral of II over the boundary of V". Now, the boundary 
of V" corresponds exactly to the singular points of the vector field defined in 
R", the sum of whose indices is, by a well-known theorem, equal to x. With 
such an interpretation the integral of II over the boundary of V" can be evaluated 
and is easily proved to be equal to x. 

The method can of course be applied to derive other formulas of the same 
type and, with suitable modifications, to deduce the Gauss-Bonnet formula for 
a Riemannian polyhedron. We publish this proof, because it is in the present 
case that the main ideas of our method are most clear. Further results will be 
given in a forthcoming paper. 


§1. Résumé of some fundamental formulas in Riemannian Geometry 


Let R" be a closed orientable differentiable manifold’ of an even dimension 
n = 2p and class r = 4. In R” suppose a Riemannian metric be defined, with 
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the fundamental tensor g;; , whose components we suppose to be of class 3. Since 
we are to deal with multiple integrals, it seems convenient to follow Cartan’s 
treatment of Riemannian Geometry,° with the theory of exterior differential 
forms, instead of the ordinary tensor analysis, playing the dominant réle. The 
differential forms which occur below are exterior differential forms. 

According to Cartan we attach to each point P of R” a set of n mutually per- 
pendicular unit vectors ¢:,--- , ¢,, With a certain orientation. Such a figure 
Pe, --+ , is called a frame. A vector v of the tangent space of R” at P can be 
referred to the frame at P, thus 


(1) v= 


where the index 7 runs from 1 to n and repeated indices imply summation. The 
law of infinitesimal displacement of tangent spaces, as defined by the parallelism 
of Levi-Civita, is given by equations of the form 


dex = wig + og = 0 


(2) 
where w;, w:; are Pfaffian forms. These Pfaffian forms satisfy the following 


“equations of structure’: 


te = 
= +23, = 0. 
In (8) Q;; are exterior quadratic differential forms and give the curvature proper- 
ties of the space. 

The forms Q;; satisfy a system of equations obtained by applying to (3) the 
theorem that the exterior derivatives of the left-hand members are zero. The 
equations are 


(3) 


(4) = 0, 


— wn Qn + = O, 
and are called the Bianchi identities. 
For the following it is useful to know how the 2;; behave when the frame ¢; - -- 
é, undergoes a proper orthogonal transformation. In a neighborhood of P in 
which the same system of coordinates is valid let e: --- e¢, be changed to eres: 
e, according to the proper orthogonal transformation: 


(5) = aizt; 
or 
(6) 


where (a;;) is a proper orthogonal matrix, whose elements a;; are functions of 
the coordinates. Suppose Q}; be formed from the frames Pe* --- es in the 
same way as Q;; are formed from Pe, ---¢,. Then we easily find 


(6) 03; = 
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From (6) we deduce an immediate consequence. Let ¢;,...;, be a symbol 
which is equal.to + 1 or — 1 according as 7, , --- , 7, form an even or odd per- 
mutation of 1, --- , m, and is otherwise zero. Since our space R” is of even di- 
mension n = 2p, we can construct the sum 


1 


(7) Q= (—1)"* 2? p! Cig Digig Qing » 


where each index runs from 1 ton. Using (6), we see that 2 remains invariant 
under a change of frame (5) and is therefore intrinsic. This intrinsic differential 
form Q is of degree n and is thus a multiple of w: ---w,. As the latter product 
(being the volume elernent of the space) is also intrinsic, we can write 


(8) Q = wa, 


where the coefficient J is a scalar invariant of the Riemannian manifold. 
With all these preparations we shall write the formula of Gauss-Bonnet in the 


following form 
(9) Q= x, 


x being the Euler-Poincaré characteristic of R”. 


§2. The space of unit vectors and a formula for 2 


From the Riemannian manifold R” we pass now to the manifold M*"™ of di- 
mension 2n — 1 formed by its unit vectors. M°"™ is a closed differentiable 
manifold of class r — 1. As its local coordinates we may of course take the 
local coordinates of R” and the components wu; of the vector v in (1), subjected 
to the condition 


(1’) uu = 1. 

If 6; ave the components of dv with respect to the frame @ -+- ¢, , we have 
(10) dv = 6:, 

where 

(11) 0; = dus + 

aia 

(12) ui; = 0. 

From (11) we get, by differentiation, 

(13) dO; = + - 


As to the effect of a change of frame (5) on the components 4; , 0; , it is evidently 
given by the equations 
(14) ul =aiju;, = aij. 
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We now construct the following two sets of differential forms: ' 
k=0,1,--+,p—1, | 
k =0,1,-+-,p—1. 
The forms 4 are of degree 2p — 1 and WY; of degree 2p, and we remark that 
W,-1 differs from @ only by a numerical factor. Using (6) and (14), we see that 
, and Y; are intrinsic and are therefore defined over the entire Riemannian 
manifold R”. 
We shall prove the following recurrent relation: 


2p — 2k — 1 


where we define ¥_, = 0. Using the property of skew-symmetry of the symbol 
€i,-.-ig, in its indices, we can write 


k=0,1,---,p—1, 


where «) is an abbreviation of ¢,...;,,. For the derivatives du;, d0;, dj; 
we can substitute their expressions from (11), (13), and (4). The resulting ex- 
pression for d®, wi'] then consist of terms of two kinds, those involving w;; and 
those not. We coliect the terms not involving w;;, which are 


This expression is obviously intrinsic. Its difference with d®, is an expression 
wh:.ch contains a factor w;; in each of its terms. 

We shall show that this difference is zero. In fact, let P be an arbitrary but 
fixed point of R”. In a neighborhood of P we can choose a family of frames 
such that at P, 


wis = 0. 
(This process is “‘equivalent” to the use of geodesic coordinates in tensor nota- 
tion.) Hence, for this particular family of frames, the expressions (18) and 
d#, are equal at P. It follows that they are identical, since both expressions are 
intrinsic and the point P is arbitrary. 
To transform the expression (18) we shall introduce the abbreviations 
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which are forms of degree 2p. Owing to the relations (1’) and (12) there are 
some simple relations between these forms and ¥,. In fact, we can write 


= — P, — 2p — k — 1)T, — 2kP,, 
which gives 
(20) = 2h + + 2p —k -1)7.. 
Again, we have | 
De = Uiy Us, Os, — Dig — Os, — — 
= T, — (2+ 
and hence 
(21) = 2k + 
The expression (18) for d®;, therefore becomes 
db, = Va + (2p — 2k —1){P. + k&k=0,1,---,p—1. 


Using (20) and (21), we get the desired formula (17). 
From (17) we can solve % in terms of d& , d®,, +--+, d&. The result is 
easily found to be 


+ Ik (k — m+ 1) 
(22) ve = (2p — 2k — 1)(2p — 2k 1)-- -(2p— 2k + 2m — 1) 


k=0,1,---,p—1. 
In particular, it follows that 2 is the exterior derivative of a ferm II: 


1 
(23) Q = (—1)"" pl = 
where 
1S 1 


§3. Proof of the Gauss-Bonnet formula 


Basing on the formula (24) we shall give a proof of the formula (9), under the 
assumption that R” is a closed orientable Riemannian manifold. 

We define in R” a continuous field of unit vectors with a point 0 of R” as 
the only singular point.’ By a well-known theorem the index of the field at 0 
is equal to x, the Euler-Poincaré characteristic of R”. This vector field defines 
in M*" a submanifold V", which has as boundary xZ, where Z is the (n — 1)- 
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dimensional cycle formed by all the unit vectors through 0. The integral of 
2 over R” is evidently equal to the same over V". Applying Stokes’s theorem, 
we get therefore 


From the definition of 4) we have 


The last sum is evidently the volume element of the (2p — 1)-dimensional unit 
sphere. Therefore 


2x” 


Substituting this into (25), we get the formula (9). 
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don THE TOPOLOGICAL THEORY OF FRECHET SURFACES! 


rem, 
By J. W. T. Younas 


(Received March 13, 1944) 


INTRODUCTION 


1. The term surface 


The object of this paper is the study of surfaces. As the title indicates we 
do not propose to consider all the mathematical entities to which the term 
surface has been applied, and, in fact, our first obligation is to define precisely 
the objects of our attention. 

In this connection even the most casual reader will have noticed a certain 
degree of confusion attached to the word surface as it is sometimes used in the 
literature. In fact, as with some other concepts, it is a comparatively recent Nig 
departure to offer a definition at all. 

If one begins with the premise that mathematical terminology should have a 
firm foundation in intuition it is perhaps difficult to justify the definition we 
shall offer, since a Fréchet surface is ultimately defined as a certain class of map- 

62 pings, and is not, strictly speaking, a geometrical object. Why then preserve e 
: the word surface in speaking of the classes we shall study? The reason is princi- » teh 

im pally historical. On the other hand, though it is quite true that the layman "hy 

an would hardly recognize the objects of our attention as surfaces, we propose to 
show that the definition is a natural consequence of a desire for rigor in the 
intuitive approach. 


init 


2. The geometric concept 


There can be no question but that the term surface was first applied to certain 
point sets. The study of geometry included the study of certain objects known f 
as surfaces. It was decided on the basis of intuition which objects were sur- 
faces and which were not. In a sense we have a catalogue of surfaces in the 
early history of the subject. 

It became generally accepted that a point set was a surface if it was in some 
sense flat at every point. These intuitive concepts were made rigorous with 
the advent of topology and the notion of two dimensional manifolds. 

So much for the geometric and terminological implications of the word. 
Around the time of Descartes, matters in a sense stood at the place where the 
body of available mathematicians passed judgment on each point set and pro- 


1 Presented to the American Mathematical Society in April of 1940, 1942 and 1943 under 
the title ‘On parametric representations of surfaces I, II and III.” The term Fréchet 


surface is due to a suggestion of Radé. 
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754 J. W. T. YOUNGS 
nounced it guilty or innocent of being a surface. It cannot be asserted that the 
introduction of analytic methods was an unadulterated blessing. 


3. The analytic approach 


An equation of the type 2° + y° + 2 = 1 determines a point set which was 
universally accepted as a surface. The equation is a means of defining the 
surface. From this there was probably a gradual transition to speaking of the 
surface x + y’? + 2° = 1. Gradually that which was employed as a description 
of an object came to replace the object itself. 

This alone led to no serious confusion, but there came a day in which one be- 
gan to speak of two equations, which to all appearances were entirely dissimilar, 
as defining the same surface. It would be an interesting but no doubt impossible 
piece of historical research to discover at just what juncture this occurred. 
Certainly the idea was well entrenched by the time Darboux wrote his treatise 
on surfaces. There he speaks of a surface being defined in two ways—para- 
metrically and non-parametrically. 

Suppose one were given two sets of continuous functions 


xz = x(u, v) a = £(u, v) 
(a) y=y(u,v) and (b) y = nu, ») 
z = 2(u, v) z= ¢(u, v) 


where in each case the point (u, v) is in the unit square X:0 SuS1,0SvS81. 

The following array of questions presented itself. I. Does the triple of func- 
tions (a) define a surface, or, what was presumably the same question, is the 
triple a surface? II. Is the triple (b) a surface? III. If both questions are 
answered in the affirmative, are the surfaces the same? 

To decide the matter, a student would have to consult the point sets deter- 
mined by the triples and decide by consulting some catalogue whether these 
were surfaces or not. Then by some standard he would have to judge whether 
the surfaces were the same. The standard of judgment was probably never so 
lax as to allow one to say that the surfaces were the same merely if the point 
sets were identical. The surfaces were usually called the same if one could 
obtain the second triple from the first by suitable change of parameter. 

Faced with the decisions to be made in this matter it became the practice to 
eliminate a consideration of the point set—and so, apparently, to discard 
geometry and topology—and state that two triples did or did not define the 
same surface purely on the basis of a change of parameter. 


4, The notion of equivalence 


All that remains to be done is to trace the growth of rigor in the concept of 
two triples defining the same surface. 
For the sake of future convenience let us think of the triples (a) and (b) as 


mappings from X onto some point set Y in Z;. The points of X are designated 
by small Latin letters. 
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Whatever is to be meant by the statement that two mappings define the same 
surface several requirements should be satisfied. The mapping (a) should de- 
fine the same surface as the mapping (a). If the mapping (a) defines the same 
surface as the mapping (b) then the mapping (b) should define the same surface 
as (a). Finally if three mappings are so related that the first defines the same 
surface as the second, and the second as the third, then the first and third should 
define the same surface. In other words the relation “defines the same surface 
as” is reflexive, symmetric and transitive; it is an equivalence relation over the 
class of continuous mappings. 

In a very real sense it may be said that this paper is the study of such equiva- 
lence relations. 

There is nothing new about this point of view and comparable statements 
have appeared in the literature, though only in recent years. On the other hand 
there is nothing extraordinary about this since the general use of equivalence 
relations essentially had to await the development of modern algebra. 

Radé has recently called my attention to a definition of curves in the spirit 
of this section. It occurs in a paper by Kneser in the Mathematische Zeitschrift 
for 1926, volume 25, page 363. 

In this connection it can be argued that Lebesgue used an equivalence relation 
which we define below though the terminology of equivalence is certainly not his. 

The mapping fi(X) is Lebesgue equivalent to the mapping f2(X) if and only if 
there is a homeomorphism h(X) = X such that f,(z) = fo(h(x)) for x « X. 

There are other standards by which one can decide if two maps “define the 
same surface’ but before going into them it is perhaps well to note the limita- 
tions imposed upon the phrase in quotation marks. It certainly cannot be 
interpreted to mean that the concept of surface has been defined. 


5. Surface and representation 


The next step is inevitable. The collection of maps which “define the same 
surface” is an equivalence class; this equivalence class is defined to be a surface. 

Explicitly the situation is this. Suppose that we are given an equivalence 
relation ~ over the class F of mappings f(X). This equivalence partitions F 
into mutually exclusive equivalence classes [f]. Each class may be considered 
as having been generated as follows. If f; is a map then the collection of all 
maps f such that f ~ fi is the equivalence class [/i]. 

It follows that fi ¢ [f:] since the relation is reflexive; and f,; ~ fz implies that 
[fi] = [fe] since the relation is symmetric and transitive. 

Hence a surface is a concept relative to some equivalence relation. A surface 
relative to the equivalence relation ~ is an equivalence class [f]. If we use 
Lebesgue equivalence we obtain Lebesgue surfaces, if we vse an equivalence 
due to Fréchet we obtain Fréchet surfaces, if we use a Blank equivalence we ob- 
tain Blank surfaces. 

In the future we shall not say that two mappings define the same surface but 
that they generate or represent the same surface if the mappings are in the same 
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equivalence class. In line with this convention a representation of a surface [f] 
is a member of the equivalence class [/]. 

It will have been noticed that the set X, or range, is a fixed set-—the unit square. 
This has been selected in connection with the triples (a) and (b) of §3 for analytic 
convenience. Generalizations in two directions are possible and are in the litera- 
ture. The range X can be taken to be the closure of a region in the plane whose 
boundary is a finite number of Jordan curves. Or the range X can be a 2-sphere. 
In the event the range X were an arc, however, the equivalence classes would 
not be called surfaces, but curves. In the event the range X were a cube, the 
equivalence classes would be called solids. 

To make the matter explicit, the name of the equivalence class depends upon 
the topological character of the range. Thus it is customary to speak of sur- 
faces of the topological type of a 2-sphere, meaning that the range is a 2-sphere, 
or surfaces of the topological type of a torus, meaning that the range is a torus. 

Another generalization should be mentioned. The statement that [f] is a 
Blank surface of the topological type of the range X means that all the mappings 
in the Blank equivalence class [f] are mappings with the fixed range X. The 
general Blank equivalence, however, will invariably have meaning as long as the 
mappings are from ranges X; and X¢ of the same topological type. For example, 
it is possible to say that the mapping f;(X,) is Lebesgue equivalent to the map- 
ping fo(X2) if and only if there is a homeomorphism h(X,) = Xz such that 
filz') = folh(x')), for 


6. Fréchet surface 


This paper is mainly concerned with Fréchet surfaces. In other words its 
principal object is the examination of Fréchet equivalence which we now propose 
to define. 

The mapping f,(X1) is Fréchet equivalent to the mapping fe(X2) if and only 
if for every positive ¢ there is a homeomorphism h,.(X:) = X2 such that p{fi(z’), 
fa(h.(x'))} < efor ¢ 

The notion of Fréchet equivalence is not as simple as the notion of Lebesgue 
equivalence. The latter may be thought of as an exact matching of the maps 
by means of a homeomorphism while the former is a matching within a con- 
trolled error of «. 

It is not difficult to construct examples to show that two mappings which are 
Fréchet equivalent are not, in general, Lebesgue equivalent. On the other 
hand it is quite obvious that Lebesgue equivalence implies Fréchet equivalence. 


7. The main problem 


The collection of maps which are Lebesgue equivalent to a given mapping 
J(X1) is easy to visualize. It is simply the collection of mappings g(X:) = 
f(h(X2)) where h(X2) = X, is a homeomorphism from X2 onto X;. The collec- 
tion of mappings which are Fréchet equivalent to f(X;) contains all those given 
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e [f] by the formula above, and others. It is not so easy to see how the others are 


constructed. 
are, This brings us to what is perhaps the fundamental problem of the paper. 


ytic Given one representation f of a Fréchet surface [f| find all the other representations. 
_ Ta. An alternate point of view 


ose 
pre. This introduction would not be complete without the addition of some com- 
uld ments regarding a different approach to the concept of surface. These remarks | 
the are the consequence of a conversation with Rado. 
In precisely the same fashion as a physicist uses mathematical models to corre- . 
on spond to, but not replace, physical phenomena, a mathematician constructs 
ur- mathematical models corresponding to mathematical phenomena. For example, ; 
re, the real number system has been used for centuries in an intuitive fashion and is 
US, so used today by many people who rely on mathematics. Thus the real number 
8 system as used for centuries is a mathematical phenomenon for which only re- 
gs cently mathematicians have constructed mathematical models. A second ex- - 
he ample is the intuitive concept of connectedness, a mathematical phenomenon 
he for which Lennes constructed a mathematical model. In all such cases the 
le, mathematical model is abstract and is to serve the expert, not the layman. 
p- Similarly, it can be argued, the words curve and surface were used intuitively ; 
at in geometry applied to figures, in analysis, to equations. A mathematical model 
for the geometric type of usage is found in the concept of oneand twodimensional 
manifolds. A mathematical model for the second type of usage is the concept ' 
of Fréchet curves and surfaces as employed in this paper. | | 
e Cuapter II 
y TOPOLOGICAL PRELIMINARIES 
, This chapter is concerned with a short statement of some of the topological 
concepts required in the body of the paper. In a general way we will assume a 
e familiarity with Whyburn [10] for the point set theoretic portions, and Alexan- 


droff-Hopf [2] for the combinatorial theory. 1 


8. Point set topology 
The list of theorems is not intended to be exhaustive and to save space only 
| the key propositions are mentioned. 
All spaces will be metric and any mapping is understood to be continuous. 
THEoREM 8.1. If f(X) = Y is a biunique mapping and X is compact then it 
ts a homeomorphism. 
THEOREM 8.2. (Eilenberg-Whyburn Factor Theorem). Jf X is compact 
then any mapping f(X) = Y can be factored, 


f(X) = lm(X) 
where m(X) = = is monotone and (2) = Y is light. 


| 
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THEorEM 8.3. In order that the mapping l(X) = Y be light it is necessary and | 


sufficient that for every « > 0 there is ad > 0 such that if C is a continuum of diameter 
< 5 in Y then any component of f(C) has a diameter < «. 

In connection with Peano space theory, where Kuratowski-Whyburn [11] is 
sufficient for our purposes, we need the following facts. 

TuroreM 8.4. The true cyclic elements of a Peano space form a null sequence. 

TuroreM 8.5. Jf A is an A-set in a Peano space X then the components of 
X — A form a null sequence and each has a single point for frontier. 

Suppose that A is an A-set, and consider the mapping 


xifzeA 


F(G) if x eG a component of X — A 


This mapping is a monotone retraction of X onto A and is of fundamental 
importance. 

THEOREM 8.6. For the Peano space X there is a sequence of cyclic chains 
C(Pn, Qn) such thai 

1) UfC(p;, qi) = An is an A-set for each n. 

2) = for each n. 

3) UA, = H is dense in X 

4) If d, is the supremum of ihe diameters of the components of X — A,, then 
d, — 0. 

5) d{C(pn, qn)} — 9. 

We are primarily interested in special Peano spaces—-notably 2-spheres. It 
is convenient to have a topological characterization of the 2-sphere and we 
mention a beautiful theorem of Kuratowski: a 2-sphere is a non-degenerate cyclic 
Peano space whose non-cutting continua are unicoherent. By means of this and 
standard properties of monotone maps it is easily shown that: 

THEOREM 8.7. The monotone image of a 2-sphere is a cactoid; i.¢., a Peano space 
whose true cyclic elements are 2-spheres. 

THEOREM 8.8. Any cactoid is the monotone image of a 2-sphere. 

Nothing has been done, apparently, with upper semi-continuous collections 
in this discussion, but their usefulness has been implied everywhere. We will 
encounter them in the future. Before concluding this paragraph we should 
mention an often applied theorem of Janisewski. . 

THEOREM 8.9. Jf {C,} is a sequence of continua in a Peano space X and 
lim inf C, # 0, then lim sup C, is a continuum. 


9. Combinatorial topology 
The principal concept here is that of orientation and degree of a mapping 
(Alexandroff-Hopf [2]). 
We shall be interested mainly in the degree of a mapping from an oriented 
2-sphere onto another. If f is the mapping the notation employed is Dgr f. 
TurorEM 9.1. If X, and Xz are oriented 2-spheres, f,(X:1) = while f.(X:) 
converges uniformly to f(X1), then for sufficienily large n, Dgr f, = Der f. 
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TazorEM 9.2. If h(X1) = Xz is a homeomorphism of an oriented 2-sphere into 
another, then Der hh = + 1. 

If J; is a (closed) 2-cell on the oriented 2-sphere X;, i = 1, 2, and we have a 
homeomorphism h(J;) = Jz, then we can speak of the degree of h in terms of 
the orientations on X,; and X, since these orientations induce orientations on 
J,and Je. 

THEOREM 9.3. Suppose that Ji, +--+ , Jin are 2-cells on an oriented 2-sphere 
=0,k = 1,2. If there is a homeomorphism fi(Ju) = Jx 
of degree 1 (—1) for k = 1, ---, n, then there is a homeomorphism f{(X:) = Xz 
which agrees with f, on Jy, and has degree 1 (—1). 

This theorem is certainly well known though apparently not stated explicitly 
in the literature. It can be proved as a consequence of some results of Adkis- 
son and MacLane [1] on extensions of homeomorphisms. Some remarks will 
be made to indicate the connection. 

There are (n — 1) arcs a, +++ , Qn; such that 

1) For each k the are a, has the end point a, on Jy and the other end point 
b, on Jix41, but no other point on any of the J,’s. 

2) = 0, 

Let fi(ax) = pe and fesi(bx) = ge. Then there is a collection of arcs 
61, , Such that the have the same properties on as the a’s on X;. 

Let P; be the set of points in the 2-cells and ares of X;,7 = 1, 2. The sets 
P; and Pz are Peano spaces on the oriented 2-spheres X; and X2. 

Let = be a homeomorphism such that g(ai) = pe, g(be) = Qe, = 
1,---,(m — 1). Let h(P,) = Ps be a mapping which agrees with g on o, and 

fion for every possible and l. 

The mapping h is a homeomorphism which satisfies the triod condition of 
Adkisson and MacLane and the theorem follows. 


III 


Maps aND HOMEOMORPHISMS 


In this chapter it wili be our object to make a preliminary application of some 
of the topological tools mentioned. 

The final objective, roughly speaking, is to approximate certain monotone 
transformations uniformly by means of homeomorphisms. The monotone 
transformations in question are from one 2-sphere onto another. The degree 
of such mappings can then be calculated at once by 9.1 and 9.2. 


10. A modification theorem 


THEOREM 10.1. Jf m(X) = Y is a monotone mapping from the 2-sphere X 
onto the 2-sphere Y, and G is the interior of a 2-cell on Y while G* = m‘(G@), then 
there is a monotone mapping h(X) = Y which is a homeomorphism on G* and else- 
where agrees with m. 

Proor. Consider .a decomposition = of X consisting of the sets m ‘(y) for 
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ye«Y — G and single points elsewhere. This decomposition is upper semi- 
continuous and so there is a unique topologization of 2 compatible with the 
condition that the related transformation ¢(X) = = be continuous. We recall 
that the transformation ¢ maps each point of X onto the element of 2 which, 
as a point set in X, contains it. Since each set of the decomposition is a con- 
tinuum the mapping ¢ is monotone. Moreover the decomposition is free of 
sets which cut X and so the cactoid 2 is a single 2-sphere. 
Define a mapping from Y — G into 2 by the following formula: 


Ay) = omy), yeY —G. 


Standard results indicate that this mapping is a homeomorphism, and so 
6(Y — G) is a 2-cell on the 2-sphere 2. By a theorem of Schoenfliess the homeo- 
morphism can be extended to a homeomorphism of Y onto 2. 

Let 6(Y) = 2 represent the extended mapping. 

Now consider h(X) = 6 $(X). We notice that 

1) his a homeomorphism on G*. 


2) h(x) = 0 
= md > X — G*. 
= m(z) 


11. Approximation theorems 


The theorem just proved is fundamental and several applications are made. 
Perhaps the easiest to state is the following. 

APPROXIMATION THEOREM 11.1. Jf m(X) = Y is a monotone map from a 2- 
sphere X onto a 2-sphere Y and « > 0, then there is a homeomorphism h(X) = Y 
such that p{h(x), m(x)} < forxeX. 

Proor. First suppose that X and Y are geometrical spheres rather than being 
merely topological. A triangulation 7’ of X is a decomposition of X into spheri- 
cal triangles A; , --- , A, such that: 1) the sides of each triangle are arcs of great 
circles, 2) two distinct triangles which have a non-vacuous intersection meet 
in a single common side, or, failing this, in a single common vertex. Let | 7'| = 
max d(A,). An 7-triangulation 7 is a triangulation with the property that 
|T| <>». 

Suppose that « = 4y > 0 and 7 is an 7-triangulation of Y. If the triangles 
are A,, -:- , A, then use the notation 7',, --- , 7’, to denote their interiors. In 
each triangle select an interior point and do the same for each side. (In the last 
case the point will be interior to the side relative to the side). Suppose that the 
points in the sides are labelled y, , --- , y». Draw three ares in each A; namely, 
from the interior point of the triangle to the interior points of the three sides. 
The sphere Y is partitioned into Jordan regions R, , --- , Rm each bounded by a 
finite number of these arcs. This is the dual decomposition. Each region 
contains exactly one vertex of the triangulation. The diameter of the largest 
of these regions is 27. 

The plan of attack is immediate. We apply Theorem 10.1 to obtain a mapping 
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(X) = ¥ which agrees with m except in m™'(7;) = Tt where it is a homeo- 
morphism h,(T:) = T,. Now apply the theorem to u(X) = Y to obtain a 
mapping ho(X) = Y which agrees with h; except in hj'(T2) = T? where it is a 
homeomorphism he(T?) = T:,. Proceeding in this fashion we arrive at h,(X) = 
agrees with hn, except in = where it is a homeomorphism 

It follows that h,(X) = Y agrees with m(X) = Y on m™(F(T; + --- + 7,)) 
and h,(T;) = Tx is a homeomorphism for each k = i, --- ,n. This implies 
p{h(x), m(x)} < » 

The method of changing m with respect to the triangles of the triangulation 
can now be applied to h, working through the Jordan regions of the dual de- 
composition. This will yield a map g which agrees with h, on hy'(F(Ri +--+ + 
Ry)) and g(hz'(R;)) = R; is a homeomorphism for each j = 1, ---, m. More- 
over, p{ha(x), g(x)} < 2m and so p{g(x), m(x)} < 3m for xe X. 

The mapping g will fail to be a 5 —eeg if at all, only 7 virtue of the 
fact that it agrees with m on the set m™'(y, + --- + y,). 

For each q select a Jordan region G, such that Yq € Gy G; -G, = 0 for j#k, 
and max d(@,) < 7. Repeat the twice used process on the collection G; , -:- , G, . 
The result is a homeomorphism h(X) = Y with the property that p{h(x), m(x)} < 
4n = efor ve X. 

The case for the general 2-sphere can easily be reduced to the argument pre- 
sented above. 

Remark. A similar theorem for 2-cells is stated by Morrey [6, p. 39]. Mor- 
rey’s proof is open to objection, and since the proof above uses the characteriza- 
tion of a 2-sphere, the theorem itself is perhaps deeper than the elementary 
comments of Morrey might lead one to believe. 

As an immediate consequence we have the useful 

Corottary 11.2. Jf X and Y are oriented, then Dgr m = +1. 

Actually the theorem is not quite in the best form for the final applications 
and we now state an easily proved modification. 

THrorEM 11.3. If m(X) = Y is a monotone map from a 2-gphere X onto a 
2-sphere Y, pi, +++, Pn is any finite set of points on Y and e > 0, then there is a 

map u(X) = Y which agrees with m on m™ (pi + +++ + Pn), t8 @ homeomorphism 
elsewhere, and p{m(x), u(a)} < for xe X. 


12. The matching theorem 


The theorem of this paragraph can reasonably be described as one of the key 


theorems of the paper. It concerns clusters. 
A cactoid > is a cluster if = = S, + --- + S, where each S; is a 2-sphere. 


Suppose that the collection of cut points to be E = p++: pm. Each will be. 


common to two or more true cyclic elements. 
Suppose we have a monotone map m(X) = = from an oriented 2-sphere X 
onto =. If S is a true cyclic element of = let m(S; X) be m(X) followed by the 
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monotone retraction of X onto S, (II, 1). This product map is a monotone map 
from X onto S and if S is oriented it-has degree 1 or —1, (11.2). 

MatcuinG ToeoreM 12.1. Jf fori = 1, 2, 

1) m(X;) = = is a monotone map. 

2) X; is an oriented 2-sphere. 

3) 2 is a cluster. 

4) The 2-spheres of = are so oriented that Dgr m(S; Xi) = 1 (—1) for each 
S of 2. 

5) With the orientation on S determined by condition 4) we have Dgr m,(S; 
Xe) = 1, for each S of &. 

6) e>0. 

Then there is a homeomorphism h,.(X1) = Xe such that 

1) p{m(z'), ma(h.(2'))} < 2’ eX. 

2) h. = 1 

Note that if —1 is read in part 4 of the hypothesis, then —1 is read in part 2 
of the conclusion. This abbreviation is often used. 

Proor. Let S,,---, 5S, be the true cyclic elements and p,,---, pm the 
cut points of 2. Suppose that the minimum distance between cut points of the 
cluster is >4n = e, and, moreover, the diameter of the smallest S, is >4n. 
On each 2-sphere S; there will be at least one point of Z unless there is but a 
single S;, and then the theorem is too simple to consider in detail as a special case. 

Select a point of K on S,. There is a Jordan region on S, of diameter <n 
which contains this point. Such regions are chosen for each point of EF on S; , 
k= 1, 

The situation is now this: Each point of E belongs to several 2-spheres of > 
and in each 2-sphere of which it is an element it lies in a Jordan region. If the 
regions are D,,---,D, then the statement D;-D, + 0, 7 # k implies 
that D;-D, = p, some point of E. The sum of all the D’s containing p is a 
neighborhood of p in 2. . 

Suppose the boundary of D, is.J; (k = 1, --- , s), the boundary being taken 
relative to the 2-sphere in which D, lies. 

Consider the set 2 — UjJ;. This is an open set with m + n components. 
Each true cyclic element contains exactly one of these components, each cut 
point is contained in one of them. 

Suppose #; is the component containing p;. It consists of the sum of as 
many D’s as there are 2-spheres having p in common. Let St be the com- 
ponent contained in S,. This is bounded by as many Jordan curves as there 
are points of E contained in S; . : 

Let us now return to the mapping m;(X;) = =. For each 7, let the mapping 
mi(S; ; Xi) = S, be denoted by mx(X;) = S,. Using 11.3 we obtain a mapping 
= S, such that: 

1) wi agrees with m; on m;'(S;, E). 
2) Is a homeomorphism elsewhere. 
3) wax = Der mx. 
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4) p{uix(a'), mix(x')} > for — E). 
The set m;'(Sx — E) will be denoted by ziz . 
For each 7 define 
| ui(z') if em@(By (III, 1) 
each We now have a means of obtaining a homeomorphism from y7'(S;) onto | 
(St), in fact, define 


Then Der h = 1 (—1). 

This mapping is not defined over all of X; nor is the image all of X;. The 
telling point is that it can be extended to a homeomorphism from X, onto X:2. . 
This is the important point, the extension need fulfill no additional properties. 


2 We shall describe the extension from the set »i'(R;) onto u2'(R). With 
the no loss of generality we may suppose that R, consists of Jordan regions on r | 
the 2spheres S,,---, S;. Then F(R; = J:+ --- + J, where the J’s are Jordan i 
4 curves, one on each of the 2-spheres. 
“ For each i the region »7'(R,) will have r Jordan curves on its boundary, 

Ca,-+:-, Ci. (Suppose that the corresponding Jordan regions complementary 
se. 
<n to wi (Ri) are Ga, , Gir.) 
; Consider the mapping wu(X;) = S, for each k. The points of E on S;, are, ij 
Se let us say, Pe = Qi, °+*,Q:- On S; each of these is in a Jordan region already eA 


chosen at the beginning of the argument, one point to a region. (These regions i | 


are the D’s). 
a Alter yy, by the technique of 10.1 with respect to these D’s to obtain a homeo- | 
morphism $(X;) = S,. We need only that Dgr = Dgr uw. Define 

felz') = 
on This is a homeomorphism, and i | 
s. Der fe = Dgr dx -Dgr ou = Der Der ou | 
it = Der wx: Der = Der mx: Der mu 

= 1(-1). 

18 
"i We notice that f, agrees with h on Cx . 
# The f’s play the réle of a tool. We have 2-cells Gu, --- , Gu taken by fi, 


+f, onto Gn, ---, Ge, respectively. These 2-cells are disjoint and the 
g degree of each f, is 1(—1). By 9.3 there is a homeomorphism g:(Xi) = X2 such 

that = = 1,---, 7, and g: = 1(—1). We shall be in- 
terested in g:(X1) = Xz only on the set ui (Ri) = 41 (R,). The letters g, now 
refer only to this partial mapping. The maps h and g; agree on the only set 
where they are both defined, the collection of Jordan curves Cu, --- , Cy. ii 
this is done for each] = 1, --- , m then the composite mapping h, 91, --* , 9m 
is a homeomorphism of degree 1 (—1) which we will name h(X1) = X2. 
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monotone retraction of X onto S, (II, 1). This product map is a monotone map 
from X onto S and if S is oriented it-has degree 1 or —1, (11.2). 

Matcuine TuHeoreM 12.1. Jf fori = 1, 2, 

1) m(X;) = = is a monotone map. 

2) X; is an oriented 2-sphere. 

3) 2 is a cluster. 

4) The 2-spheres of 2 are so oriented that Dgr m(S; X1) = 1 (—1) for each 
S of 2. 

5) With the orientation on S determined by condition 4) we have Dgr m,(S; 
Xe) = 1, for each S of >. 

6) 0. 

Then there is a homeomorphism h,(X1) = Xe such that 

1) p{mi(z'), mo(h.(2'))} < ¢, a’ eX. 

Note that if —1 is read in part 4 of the hypothesis, then —1 is read in part 2 
of the conclusion. This abbreviation is often used. 

Proor. Let S,,---, 5S, be the true cyclic elements and p,, --- , Pm the 
cut points of 2. Suppose that the minimum distance between cut points of the 
cluster is >4n = e, and, moreover, the diameter of the smallest S; is >4pn. 
On each 2-sphere S;, there will be at least one point of Z unless there is but a 
single S; and then the theorem is too simple to consider in detail as a special case. 

Select a point of EF on S,. There is a Jordan region on S, of diameter <n 
which contains this point. Such regions are chosen for each point of EZ on S, , 
k= 1, +++, 

The situation is now this: Each point of E belongs to several 2-spheres of > 
and in each 2-sphere of which it is an element it lies in a Jordan region. If the 
regions are D,,---,D, then the statement D;-D, + 0, 7 #¥ k implies 
that D;-D, = p, some point of E. The sum of all the D’s containing 7p is a 
neighborhood of p in 2. 

Suppose the boundary of D, is J; (k = 1, --- , 8), the boundary being taken 
relative to the 2-sphere in which D, lies. 

Consider the set 2 — UiJ;,. This is an open set with m + n components. 
Each true cyclic element contains exactly one of these components, each cut 
point is contained in one of them. 

Suppose #; is the component containing p,. It consists of the sum of as 
many D’s as there are 2-spheres having p in common. Let S; be the com- 
ponent contained in S,. This is bounded by as many Jordan curves as there 
are points of F contained in S;. 

Let us now return to the mapping m,(X;) = &. For each i, let the mapping 
mi(S; ; Xi) = S, be denoted by mix(X,) = S,. Using 11.3 we obtain a mapping 
= Sk such that: 

1) agrees with m; on mz'(S;, E). 
2) Is a homeomorphism elsewhere. 
3) Der bik = Der Mik « 
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4) p{ua(z'), > for 2' emz'(S, — E). 
The set m;'(Sz — E) will be denoted by zx. 


For each 7 define 


) if m;'(E) (III, 1) 

We now have a means of obtaining a homeomorphism from yji'(S;) onto 
in fact, define 


h(z') for 2 eui' (Sr), k=1,---,n. 


Then Dgr h = 1 (1). 

This mapping is not defined over all of X; nor is the image all of X.. The 
telling point is that it can be extended to a homeomorphism from X, onto X2 . 
This is the important point, the extension need fulfill no additional properties. 

We shall describe the extension from the set ui'(R,) onto u2'(R,). With 
no loss of generality. we may suppose that FR, consists of Jordan regions on r 
2spheres S,,---, S;. Then F(R; = Ji+ --- + J, where the J’s are Jordan 
curves, one on each of the 2-spheres. 

For each 7 the region yz '(R;) will have r Jordan curves on its boundary, 
Ca,:-:, Ci. (Suppose that the corresponding Jordan regions complementary 
to uz (Ri) are Ga Gi, 

Consider the mapping yan(X;) = S; for each k. The points of E on &;, are, 
let us say, Pe = Gi, ***,@Q:- On S; each of these is in a Jordan region already 
chosen at the beginning of the argument, one point to a region. (These regions 
are the D’s). 

Alter un, by the technique of 10.1 with respect to these D’s to obtain a homeo- 
morphism ¢(X;) = S,. We need only that Dgr ¢% = Dgr ux. Define 

fla’) = dudu(c’). 

This is a homeomorphism, and 

Der fe = Dgr -Dgr du = Der Der ou 
= Dgr wx: Dgr = Dgr ma: Dgr mu 
= 1(-1). 
We notice that f, agrees with h on Cy. 

The f’s play the réle of a tool. We have 2-cells Gu, --- , Gu taken by fi, 
f, onto Ga, ---, Go, respectively. These 2-cells are disjoint and the 
degree of each f, is 1(—1). By 9.3 there is a homeomorphism g;(Xi) = X2 such 
that g(@u) = fx(Gu), & = 1, 7, and Dgr = 1(—1). We shall be in- 
terested in g;(X:) = Xz only on the set (Ri) = (Ri). The letters g, now 
refer only to this partial mapping. The maps h and g; agree on the only set 
where they are both defined, the collection of Jordan curves Cu, Ci. If 


this is done for each 1 = 1, --- , m then the composite mapping h, 9: , -** , Jm 
is a homeomorphism of degree 1 (—1) which we will name h(X3) = Xe. 
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If eyi'(St) then = = m(c'). Hence p{u(z'), 
poh. (2')} = 0. This is true for k = 1, 

If then = gilz') and hence and yagi (z') 
are both points in R, which is of diameter < 27. 

Hence p{yu(z'), < 2n. 

Combining these facts we have p{ ual < 2n for 

On the other hand p{u:(2'), < 9, Xi, 7 = 1, 2. 

Therefore p{m,(zx'), msh.(x')} < 4n < ¢ and the proof is complete. 


CHAPTER IV 
THE EQUIVALENCES 


In Chapter I we saw that the concept at the heart of surface theory is that of 
an equivalence. In our efforts towards solving the fundamental problem men- 
tioned in §7 we shall need several types of equivalence relations not mentioned 
in the introduction. These will be defined and their properties investigated. 


13. Fréchet equivalences 


The ordinary Fréchet equivalence has already been defined in §6. The defini- 
tion has meaning as long as the spaces are metric. In the next definition, how- 
ever, it is understood that X, and X- are oriented 2-spheres (or 2-cells). For 
convenience let us understand that the range of a mapping will be an oriented 
2-sphere unless the contrary is explicitly stated. 

A mapping f:(X1) = Y is said to be positively Fréchet equivalent to a mapping 
f2(X2) = Y if and only if for every « > 0 there is a homeomorphism h,(X1) = X2 
such that: 1) p{fi(2"), fo(h.(x'))} < every in X, and 2) Der = 1. 

A mapping f:(X1) = Y is negatively Fréchet related to a mapping fo(X2) = 
if condition 2) above is modified to read that Dgrh. = — 

For convenience we employ the notations f,F ~ fo, fiF* ~ fe, andfiF ~ fr 
respectively. 

The last of these relations is not an equivalence, but the first two are. Though 
the name of Fréchet has been attached to all of these relations it is only the first 


-for which he is directly responsible. The second is properly due to McShane 


[4] and we write f,F* ~ f, merely for terminological convenience. The third 
relation appears to have no importance by itself but has considerable bearing 
on the problem we have in mind. 
Suppose we say that fi ~ fe if iF* ~ fe or {iF ~ fe. It can be shown that. 
this is a bona fide equivalence relation and the following theorem holds. 
THEOREM 13.1. If fi ~ fe then fi ~ fe, and conversely. 


14. A counter-example 


The three relations so far introduced are to be compared with three which 
will be defined in §15 by means of the Eilenberg-Whyburn Factor Theorem 
(8.2). We digress for a moment to make a historical comment. 
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If we have two mappings fi(Xi) = Y and f(X:) = Y then there will be two 
cactoids 2, and 2, together with two monotone maps m and m, and two light 
transformations 1; and , held together by the following relations: 

= lem {(X), = = Y, =1,2. 

The mapping fi(X:1) = Y is said to be Kerékjdrié equivalent to the mapping 
f,(X:) = Y if and only if there is a homeomorphism h(2,) = 2: such that p{l:(o:), 
L(h(o1))} = 0 for every point o; of 2,. (Notation: AK ~ fr.) 

This relation could be rephrased so as to read that fiK ~ fe if and only if 4 
and , are Lebesgue equivalent. (§4). 

The notion is due to Kerékjarté [3]. Sometime later, Morrey [5] used the 
same notion without being aware of Kerékjarté’s contribution in this direction. 
In some respects it is unfortunate that the definition is recorded here at all 
as we shall merely make passing use of it. On the other hand its historical 
interest carries considerable weight and deserves mention. 

Kerékj4rté proved two theorems. The first that if fi ~ fe then fiK ~ fr. 
This theorem is true and his proof, though somewhat sketchy, certainly supplies 
the skeleton of the argument in the more general Theorem 16.5 proved later 
in this paper. Morrey does not attempt to investigate this question as it has 
no bearing, except an asthetic one, on the applications he has in mind. The 
second theorem of Kerékjarté states that if iK ~ fo then fi ~ f2. This 
theorem is also stated by Morrey in substantially the same form and used 
extensively by him in some beautiful applications to the problem of area [6]. 
This particular theorem, however, is false and a simple counter-example was 
exhibited by Youngs in 1940. 

The facts, then, are these. A theorem which is not applied is true, but one 
with extensive applications is false. It can be stated that the impetus for this 
paper came from a desire to see what could be saved in this situation. 

The characterization of Fréchet equivalence in terms of Kerékjarté equivalence 
would, if true, be a neat result apart from any applications. This cannot be 
done, but we propose to characterize it together with the other relations Ft~ 
and F” in terms of concepts akin to Kerékjarté equivalence. 

Before defining these relations let us look at a counter-example to the theorem 
of Kerékjérté and Morrey. The example appears to be in as simple a form as 
possible. 

Consider the mapping f;(X) = Y which consists of a transformation from the 
oriented 2-sphere X of radius 1 onto a pair of tangent 2-spheres. It is convenient 
to consider a great circle on X, to call it the equator and speak of the northern 
and southern hemispheres. Explicitly, the point x on X with latitude @ is 
moved directly towards the center of X and the image is at a distance sin 0 
from the center. The closed northern hemisphere is thus mapped onto one of 
the two tangent spheres arbitrarily called the upper sphere; the closed southern 
hemisphere is mapped onto the other, called the lower sphere. 

The mapping fo(X) = Y is defined as the mapping f,(X) = Y followed by a 
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reflection of the upper tangent sphere on a plane containing the center and point 
of tangency. 

Factor each mapping to get f:(X) = m(X) = 2;,1(2s) = Y,i = 1,2. 
The abstract spaces 2; and 2, will each consist of two tangent 2-spheres. 

Define Iz'h(o1), o,€ 21. 

We notice that l,(h(o:)) = h(o:) and so we need merely to check on the fact 
that h is a homeomorphism to state that fiK ~ fe. This follows at once on 
observing that both J, and J, are homeomorphisms since every inverse set is a 
single point and we are dealing with compact spaces, (8.1). 

Now suppose that f;f ~ fe. Then for each integer n there is a homeo- 
morphism h,(X) = X compatible with the requirement that p{fi(z), fallen(x))} < 
1/n, for every x in X. 

We shall speak of the northern (southern) cap of X as the spherical zone of 
points with latitudes not less than +/6N(S). In both zones the mappings 
f, and fz are homeomorphisms, thus h,(x) is uniformly close to f2'f,(x) in both 
the northern and southern caps. (In fact, an easy calculation indicates that 
h,(x) is within 2/n of fe'f(z) in these caps.) But in any event, the mapping 
fe fi(x) is a reflection on a plane through the poles of X in the northern cap 
and hence its degree is —1. On the southern cap f2'f,(x) is the identity and 
has degree 1. 

It follows by Theor 9.1 and 9.2 that if f,F ~ fe there is a homeomorphism 
h(X) = X which simultaneously has degree 1 and —1. This is impossible. 


15. The modified Kerékjarté equivalences 


The ordinary Kerékjarté equivalence fails, as the counter-example indicates, 
due to the fact that it ignores the matter of orientation and degree of a mapping. 
In view of this the following natural modifications are proposed. 

As in §14 the mappings f(X:) = Y are factored, giving | 

f(Xi) = 

m,(X;) = 2;, a monotone map from X on to the cactoid >; 

1(2:) = Y, alight mapping from 2; onto Y. 
_ Suppose that the general true cyclic element of 2; is S; and consider the map- 
ping m,(S;; X:) = S;. It is understood that X, and X. are oriented and so 
by 11.2 it is possible to select an orientation on S; such that the degree of this 
monotone mapping will be 1. | 

Suppose further that there is a homeomorphism h(2,) = 22. If S; and S&: 
are corresponding 2-spheres then h is said to have degree 1 (—1) on 8S, if the de- 
gree of the partial mapping h(S,) = S, is 1 (—1). The homeomorphism h is 
said to have degree 1 (—1) on 2; if Dgr h(S;) is 1 (—1) on each true cyclic 
element of 2; . 

The mapping fi(Xi) = Y is said to be positively Kerékjdrté equivalent to the 
mapping fo(X2) = Y if and only if there is a homeomorphism h(2,) = 2 such 
that 1) = 0 for o; 2; and 2) Dgrh = 1(—1) on. (Nota- 
tion: ~ fr). 


ij 
r 
| 
pu 
1) 
fil 
li 
| to 
fil 
i 
had! 
it ol 
| 
| 
ni 
W 
3 
0 
hid 


THEORY OF FRECHET SURFACES 767 


‘The mapping fi(Xi) = Y is said to be negatively Kerékjdrté related to the map- 

ing fo(X2) = Y if and only if there is a homeomorphism h(2,) = such that 

= 0 for and 2) Dgr h = —1 on (Notation: 
fr). 

Finally the mapping f;(Xi:) = Y is said to be strongly Kerékjdrté equivalent 
to the mapping fo(X2) = Y if and only iffiK* frorfiK” fe. (Notation: 
fK* ~ fr). 

There is no trouble in seeing that the first relation is an equivalence, indeed, 
the following statement is almost as easy to verify. 

TororEM 15.1. The relation K* ~ is an equivalence. 


16. The characterization theorems 


We have already stated that these modified Kerékjdrté equivalences fill the 
gap left by Kerékj4rté and Morrey. The facts are these. 

16.1. If fiK* ~ fe then ~ fo, and conversely. 

16.2. Jf fiK then fiF and conversely. 

THrorEM 16.3. Jf fiK* ~ fe then fiF ~ fe, and conversely. 

The last theorem is a logical consequence of the first two and so we shall 
concentrate on these. As a matter of fact we can carry the proofs simultane- 
ously, as they differ only in regard to comments on the degree. 

It will be useful to deal with an equivalent form of the Kerékjarté relations. 

Suppose that the 2-spheres of 2; are oriented in the standard way so that 
Dgr mi('S; ; X) = 1 for each S; of 2;, 7 = 1, 2. 

IffikK’~fe(fiK fr) we shall be interested in the factorization f,\(X1) = 
bhm,(X). Now hm,(X) = mj (X) is a monotone map from X; onto 2» and since 
Dgr h = 1 (—1), Dgr mi(S:; X:) = 1 (—1) for each S: of 2. Using the 
notation 1 for 1, and = for , we have f;(X:) = lmi(X) and fo(X2) = lm,(X2), 
where Dgr m:(S; Xi) = 1 (—1) for each S of =. Now let us use the notation 
m, for ‘ 


Hence fi(X1) = lm (X), m(X)=2, 
fo(X2) =Im,(X), m(X)=2%, U2) =Y, (IV, 1) 
and Der m(S;X)=1(-1),while 


Dgr m (S; X) = 1 for every S of =. 

Conversely, if two mappings f:(X1) and fo(X2) satisfy the conditions of (IV, 1) 
then fiK* ~ fo fa). 

We are now in a position to state 

Lemma 16.4. If two mappings f,(X1) and fo(X2) satisfy the conditions of (IV, 1) 
and « > 0, then there are mappings fi (X1) and fa (X2) such that 

1) fi(X) = where = is monotone, and 
G(2*) = Y is light; i = 1, 2. 

2) Dgr mr(S*; Xi) = 1 (—1) for each S* of =*. (IV, 2) 

3) Der mi(S*; X) = 1 for each S* of =*. 

4) pf fila’), = 1,2. 

5) 2* is a cluster. 
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Proor. In obtaining the new, or refined mappings it will be necessary to use 
the fact that all the maps are not only continuous but uniformly so, a fact which 
follows from the compactness of the spaces under consideration. Hence we can 
assert that if 4n = « > 0 there is a 6 such that p{oi, 02} < 6 implies p{I(«,), 
< 7. 

The theorem is trivial if 2 is degenerate. 

We now turn to. the fundamental structure theorem (8.6) for Peano spaces 
and make our first application of uniform continuity. Relative to the space > 
there is a sequence of cyclic chains C(p,, gn) such that 

1) UP C(pi, gi) = An is an A-set for every n. 

2) An-C(pnit, Un+i) = for every n. 

3) UP A, = H is dense in >. 

4) If d, is the supremum of the diameters of the components of 2 — A,, 
then d, — 0. 

First MopiricaTion. It is possible to select an n such that d, < 6. Hence 
each component of 2 — A, has a diameter <6, and this guarantees that the 
image under | of each of these components has a diameter <7. For convenience 
we shall employ the notation A for A, . 

The mapping m,(A; X;), that is m;(X;) followed by the retraction of 2 onto 
A yields g;(X:) = lm,;(A; X:) where the map / is now restricted to A. We 
assert that 


SECOND MODIFICATION. In general there will be an infinite number of 2- 
spheres in A, on the other hand by 8.4 we know that only a finite number of 
these can have diameters = 6. Delete from the 2-spheres of A every one which 
contains a point of the set (p1, , Pn, 41, °** Qn), and those whose diameters 
are = 6. Then the infinite collection @ of 2-spheres of A remaining has the fol- 
lowing fortunate properties. 

1) Each 2-sphere of @ belongs to C(p;, q:) for exactly one index 7. 

2) Each 2-sphere of @ contains exactly two of the cut points of A. 

Now examine the general 2-sphere S of the collection @. Since it has two cut 
points uniquely determined on it, we can map it homeomorphically onto a 
geometric sphere so that one of the cut points is taken into the north pole, the 
other into the south pole. This enables us to speak of the lines of latitude and 
the meridians on S in the obvious way. A mapping of S onto a meridian of S 
is now to be defined. A definite meridian is chosen. A general point on S lieson 
some latitude. This latitude cuts the meridian at precisely one point. It is 
this point of intersection which is designated as the image of the general point 
on S. The mapping is clearly continuous, and, in fact, monotone. 

If we apply this mapping to each 2-sphere of A in the infinite collection 
the set A is mapped onto a subset A* which is also an A-set relative to itself. 
Designate this mapping by ¢(A) = A*, the mapping ¢m;(A; X,;) = A* by 
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y(X;) = A*. Both of these mappings are monotone. Define k,(X;) = lu;(X,) 
where / is of course restricted to the set A*. We assert that 


pl{ki(x’), <n, *=1,2......... (IV, 4) 


We remark that Dgr u:(S; X:) = 1 (—1) and Der ua(S; X2) = 1 for each S 
of A*. 

FINAL MODIFICATION. Let pe £ if p is a cut point on a 2-sphere of A* or if 
pis a point of the collection (p,,---, Pn, %,°*:,@n). The set E is finite. 
Consider the components of A -- HE. They are finite in number and will be of 
two types: there will be 2-spheres punctured at a finite number of points; there 
will be open ares. For a moment we will be interested in the ares. 

Each open arc has the property that the removal of a certain finite number of 
its points breaks it into a finite number of components each of diameter <6. 
It is possible to perform this operation in each of the open arcs so as to obtain a 
finite set E* with the property that: 1) E* > E; 2) the components of A* — E* 
consist of precisely those mutilated 2-spheres which are components of A — E 
and a finite number of open arcs a each of diameter <6; 3) the end points of a 
are mapped into distinct points by I. 

The mapping yu;(X;) = A* defined in the second modification is monotone 
and so each of the inverse sets 7 (a), a « A* is connected, i = 1, 2. Consider 
the upper semi-continuous collection =* consisting of the sets uz*(a) for a in no 
aand single points elsewhere. This collection is topologized to yield a cactoid 
which will be denoted by =* and the related map m:(X;) = >* is defined as 
usual. This map is of course monotone. 

About this construction we make several remarks which can be proved by 
standard methods. 

First, the space 2=* is a cluster. 

Second, the space A* can be imbedded in 2*. Hence it is possible for us to 
consider A* C 

If we think of A* C >* then some of the 2-spheres of =* are already oriented 
by virtue of the fact that they are also 2-spheres of A*. For these 2-spheres 
S* we know that Dgr mj (S*; X) = 1(—1). Orient the other 2-spheres so that 
the above requirement as to degree | is uniform. 

We now wish to define a mapping m2 (X2) = =*. If S* isa true cyclic element 
n A* then mz, is defined to be 42. Consider a 2-sphere S* of 2* which is not a 
2sphere of A*. Then S* has exactly two cut points a and b and a unique arc 
a of A* joins them in S*. There is a mapping of degree 1 from the set ua (a) 
onto S* which agrees with yw: on u2'(a) and yo (b). Do this for each S* of =* 
not in A*. We now obtain a composite mapping m2(X2) = =* such that 
Der m(S*; X2) = 1 for each S* of =*. It is of course necessary to show that 
this composite mapping is continuous but this is not difficult. 

We pick up the thread of argument in connection with the mapping /(A*) C Y 
which has meaning only on a subset of =*. Ignore J except on the 2-spheres of 
A* C >*, and the points of Z*. On this set define [* to be 1. Consider one of 
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the arcs a. This lies in some S* of =* not in A*. If @ has end points a and } 
then /*(a) and [*(b) have been defined. Define /*(S*) to be a topological ex- 
tension onto the 2-sphere in EF with I*(a) and /*(b) as poles. This construction 
will work since I*(a) # I*(b) by our choice of E*. Proceeding in this fashion 
we obtain a light mapping /*(2*) = Y*. 

(Actually the lightness of this map is important only for esthetic reasons.) 

Now consider m}(X;) = and /*(2*) = Y*, = 1, 2. 

Define f; (Xi) = I*m7(X,), i = 1,2. We assert that 


pife(z’), k(a’)} <2n, a *=1,2........ (IV, 5) 


If x* ¢yz'(S) where S is a true cyclic element of A*, or if x € uz'(E*) then the 
distance above is zero. If x’ eu; (a) where the end points of a are a and b 
then /* and 1 agree on a. On the other hand fi (2*) is within 7 of I*(a) and f,(z') 
within 7 of l(a). 

We can now state that 


plfi(a'), fila’) <<4n =e for X;, 1,2, 


since this result follows from (IV, 3, 4 and 5). 

Moreover, Dgr mi (S*; X,;) = 1 (-—1), and m,(S*; X2) = 1 for each S* of >*. 
Hence, in view of the fact that =* is a cluster we have obtained the mappings 
fi (X;), i = 1, 2 held up as our objective in the lemma. 

We make an immediate application of this lemma together with the matching 
Theorem 12.1 to prove a portion of the Theorems 16.1 and 16.2. There is a ¢ 
such that if , 02} < ¢, then p{l*(o1), < 

By the matching theorem there is a homeomorphism hj(X,) = Xe of degree 
1 (—1) such that (2'), m2 (hs(2"))} < ¢ for every Hence p{fi(z’), 
fe (hs(x'))} < , And now by Lemma 16.4 we have p{fi(2"), fo(hs(z'))} 
< 4n + 4n + = 9n for €2,. This shows that 


~h 
Ak fofF hf 
~ fe 


The last line is a logical consequence of the first two. 

The converse of these propositions is, in some respects, simpler, and a portion 
of the converse is true in great generality. Since there is no added difficulty in 
proving the more general theorems we make this our goal. 

We note that for each 7, if X; is a general Peano space and f,(X;) = Y isa 
mapping onto the Peano space Y, then, though the hyperspace. 2; associated 
with the monotone factor m; of f; will not necessarily be a cactoid it is obvious 
what is meant by the notations f, F ~ feandf; K ~ fe. There is of course 
no question of orientation and so degree is meaningless. 

It will be our purpose to show that fi F ~ foimpliesf, K ~ fe. 

We need to show that there is a homeomorphism between 2, and 2: satisfying 
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a certain condition relative to the light factors J, and i of fi and fe. Since the 
spaces are compact it is sufficient to show that there is a one to one continuous 
transformation having this added property. (8.1). 

TeorEM 16.5. If fi(X1) = Y and fo(X2) = Y are mappings from Peano spaces 
X, and X2 onto Y, and for every « > 0 there is a continuous transformation g.(X,) = 
X, such that p{fi(x"), fa(ge(x'))} < Xi, then there is a continuous transfor- 
mation y(2i) = 2 between the associated hyperspaces such that p{l(o:), le(y(o1)) = 
0, 

Proor. Since X; is a Peano space there is a dense set of points b; , be , bs , +: - 
inX,. Suppose that if « = 1/n we employ the notation g,(X;) for the associated 
transformation. 

The space Xz is compact and so there is a convergent subsequence of {g,(b)}. 
Denote this sequence by {gin(b:)}. For precisely the same reason there is a con- 
vergent subsequence of {gin(be)}. Denote it by {gen(be)}. Proceeding in this 
fashion we obtain for each k a convergent subsequence {gin(b,)} of the sequence 
gr-in(be-1)}. We notice that {gin(b;)} is convergent for any j < k since a subse- 
quence of a convergent sequence is convergent. Finally, the sequence {gn»(b,)} 
is convergent for each k. 

The mapping gnn(X1) = Xe occupies a position in the sequence {g,(X,)} not 
before the nth term, hence 


The double subscript is discarded and the notation g,(X,) is used. For any k 
the sequence {g,(b.)} is convergent. 

Select any point o in 2, and consider the continuum njz'(o’). By the defini- 
tion of 2, it follows that f,(mz'(o’)) is a single point y of Y. For any number 
> 0 let U(n, y) denote the set of points in Y whose distance from y is less 
than ». This is an open set. The fact that fi is continuous guarantees that for 
any positive integer the set f7'(U(1/», y) is an open set containing mj’ (o’). 
Let V,(mz'(o')) denote the component of f;'(U(1/», y)) which contains the con- 
tinuum mz"(o'). It is convenient to shorten the notation to V, if no ambiguity 
can arise. That V, is open follows from the fact that X is Peanian. There is 
some k such that b; ¢ V, and so by is in the continuum V,. Hence g,(V,) = 
C,, contains the convergent sequence of points g,(b,). By the Janisewski theo- 
rem, lim sup gn(V,) is a continuum C,. 


Select any x” in g,(V,). Then there is an 2’ such that 2’ ¢ V, and g,(z') = 2°. 
It follows that 


plfa(x"), y} S pffe(gn(a’), filai)} + y} S 1/n + 
Therefore 
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If x’ « C, then for each integer n there is a point 2, ¢C,, and a subsequence 
2°. Choose the point in V, in such a fashion that g,(z,) = Then 


plfa(x’), y} S folans) + y} 
S p{fo(x*), fola'ns) + + 
The expression on the right converges to 1/y as j goes to infinity. Hence 


Finally, consider the sequence {V,} and note that V, D V,4; hence C,, = 
Gn(V») D gn(Vr41) = Cry. This implies that C, = lim sup C,, > lim sup 


Cnvi1 = Cy41. Hence we obtain the continuum C = NM? C,. In virtue of (IV, 
8) above it follows that fo(C) C U(1/», y) for each ». Hence 


This means that C C fz'(y) and as C is a continuum there is an inverse set m2’ (a’) 
containing C. Define 


As an immediate remark we note that l(c) = h(y(0c')) = y hence we need 
only to check the statement that y(21) = 22 and the continuity of y to prove the 
theorem. 

First we prove that y(21) = 22. 

Select any o’, and let y = fo(o’). In exactly the same manner as V, was se- 
lected relative to mz'(o') choose the region W, containing mz'(c°). We recall 
that fo(W,) C U(i/,, y). 

Now consider g, (W,) = E,,. This set is not connected, in general, but it is 
closed since g is continuous. 

The space X, is compact and so the set Z, = lim sup E,, is not empty. More- 


over E, is closed. 

If then pl filr'), y} folgn(x'))} > p{fo(gn(z')), y} 1/n + 
1/v. Hence fi(En») C U(1/n + 1/», y). Precisely as in (IV, 8) we obtain fi(Z,) 
Cc U(1/», y). Continuing to follow the pattern we see that E,,, D En,y4: and 
E, = limsup £,, > lim sup Ey,41 = E,41. The closed set ZH = / EZ, is such that 


fi(E) C U(1/», y) for each v. Hence f,\(Z) = y. 
This means that there is a point o' and a component K of E such that mj" 
(c') > K. We assert that 
y(c’) = 
This will of course suffice to show that y(21) = 22. 
Suppose that y(c’) = ¢ ¥ o. Then there are two disjoint regions Ri and Re 


containing o” and ¢ respectively. The inverse sets m2'(R;) and m3'(Rz) will 


also be disjoint regions containing m2"(o") and m2'(¢) respectively. Name 
these regions U(o’) and U(¢). 
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The notation V,(m;z'(c')) is already familiar, We note that V,> K. As 
before we obtain the continua C,,, C, and C. By (IV, 10) C C m;*(b), since 
y(¢) =. Hence there is an index » such that C, C U(é). Hence there is an 
integer % With the property that ifn > mm 


On the other hand because of the compactness of X2 there is an index \ such 
that W, U(o’). Since K is a component of E, the set E 
Hence theye is a sequence of integers ki < k, < ks < --- such that E, a= 
ii, (Ws) intersects V, for every j. But.then Wy-g:,(V,) ¥ 0 which implies ‘that 

U(o’)-gx;(V.) # 0 for every j. This is contradictory to (IV, 11). 

This shows that the mapping 7 is from 2; onto 2, . 

We need to check the continuity of y. 

If y is not continuous then we may suppose the existence of a sequence a; — 
such that y(ot) > ¢ ¥ = 

Let U(¢) and U(o’) have the same meaning as above. We recall that the 
regions are disjoint. 

As in the case of (IV, 11) we obtain 


On the other hand (ok) = o;, — ¢ and so there is an integer k such that both 
U ©) and V,(mz"(0')). By (IV, 10) we know that the con- 
tinuum m3" the continuum lim Hence for 


all but a finite number of n we have g,(V, (mz(o2))) Cc U(g). Hence g,(mz"(oi)) 
C U(¢) for all but a finite number of n. And now, since mz'(o4) C V,(my'(0')) 
by our choice of k, we see that g,(V,). U(¢) ¥ 0 in flat contradiction to (IV, 12). 

This completes the proof of the final part of the theorem. 

Using the terminology of 16.5 we have the following 

16.6. If = y(o') then mz'(o°) D lim ‘sup 


Proor. As in 16.5 we have V,(mz'(o')), from Cav = gn(Vr) D 
(o')). Therefore C, = lim sup g,(V,) > lim sup gn(mz'(o')) for each v. (The 


set on the right is probably not a continuum). But m2"(o°) D C > lim sup 
gn(mi'(c")), as can be seen from (IV, 10). 

Coronary 16.7. If each of the mappings g,(X1) = Xz is a homeomorphism, 
then y(21) = also a homeomorphism. 

Proor. Since 2; is compact it is sufficient to show that 7 is one to one, (8. 1). 


Suppose that there are three points oj , , and such that = = y(c2). 


If (c°) = ylet W (ma "(o")) be defined as in 16.5. There is an m such that if 
n> then g,(mz"(o1)) and g,(m;'(02)) liein W,. Thus the set = gn(W,) 
> mz'(ci) + miz'(o2) and is a continuum since In. is a homeomorphism. Hence 
by the Janisewski theorem E, = lim sup Ew D> + (02) and is a contin- 
wm. Finally E = NE, D + mz'(o2) and is a continuum. 
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On the other hand, a duplication of the argument used in showing that the 
mapping y is from 2; onto 22 shows that fi(Z) = v. Hence E = mj‘(c}) = 
my (02). 

We have thus proved 

THEorREM 16.8. Jffi F ~fe,thenfi K~fr. 

This is the theorem which Kerékjarté stated for 2-spheres. 

With the notation of 16.5 and 16.6 we have 

THEOREM 16.9. The sequence {m2(gn(x'))} converges uniformly to ym,(z'). 

Proor. Suppose G is any region on 2,. Let i(@) = K, then by the defini- 
tion of y we have lay(G) = K. 

Suppose 2” elim sup g,mi'(G) then there is for each n a point 2°, € g,m;'(@) 
and a subsequence 2,, 2°. Select a point x, mi such that gn(xn) = 
We may suppose that — a. Hence p{ < 1/n; and f,(z') = 
fo(x”). But f(z’) = = K. Therefore fe(x”) eK. Hence 


fo(lim sup gamy'(@)) C (IV, 13) 


On the other hand G is open and so m;'(G) contains one of the everywhere 
dense set of points b; , be, bs, --- of 16.5. Hence lim sup g,mj'(G) is a continuum 


and must lie in some component K° of fz'(K). 

Suppose o’ « G@ and o” = y(o'), then by Corollary 16.6 we know that lim sup 
gum, Therefore, lim sup gnmi'(@). mz '(y(G)) 0. Since 
m2 is a continuum, lim sup and mz"(7(G)) lie in the same compo- 


nent K’ of fz'(K). 

Now consider the component C’ of Iz'(K) which contains y(@). Such a com- 
' ponent exists since y(G) is a continuum and i,(y(G)) = K. We notice that 
m:'(C”) = K*. If the diameter of C” is less than some number « then there is 
also a region G’ containing C” and of diameter <e. Now lim sup g,m;'(@) C 


lim sup K*® = Hence there is an integer 
such that if n > 


This is the key step in the argument. 

The map /,(22) is light and so by 8.3 there is a 6 such that if K is a continuum 
of diameter < 6 each component of Iz'(K) is of diameter < «. 

The space 2; is the sum of a finite number of regions G,, --- , G, each of 
diameter < ¢ and each having an image under |; of diameter < 6. If 4(G;) = 
K; then d(K;) < 6. Hence each component of I;'(K;) has diameter < «. That 
component Cj containing y(G;) is contained in a region Gi of diameter < «. 
Since there are only a finite number of sets in the collection G, , --- , Gn there is 
an m such that if n > m 


gnmy (G;) fori = 1,---,k. 
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Select any There is a G; such that The set G? contains 
and so-y(m,(z')). If nm > no then ga(z") € C m3"(G?). Therefore 


Magn(x') € 


Hence p{magn(x"), ym(z')} < d(Gi) < and the theorem is proved. 

Suppose that S is a 2-sphere of 2, and y(S') = S*. For convenience let 
m(S', X:) = m(X1) = S' and m,(S’, X:) = = 

Corottary 16.10. The sequence {egn(x')} converges uniformly to yu;(z'). 

Proor. For any « = 3y > 0 there are at most a finite number of components 

t, Ks of — S* which have diameters Consider K? and a? = 
F(K?), the frontier of Kj. There is a minimum distance 3¢ between points 
of the set ai, ---,a@,. On S’ select a connected neighborhood R? of a} (open 
with respect to S*) with diameter less than ¢ and also less than 7. Add to R} 
all the components of 2, — S° with frontier points in Rj. By a standard 
theorem of the cyclic element theory this new set G} is a connected open set 
containing R}. No point of Gj is at a distance greater than 2y from some 
point of K;. These open sets are disjoint. 


Let y "(K3) = K} and m;'(K}) = K;. By theorem 16.9 we know that 


mag,(') = ym,(z") and hence there is an m such that if n > no then mg,(K;) C 
G. Suppose no is larger than any of the integers mn, --- , n, and in addition 
has the property that p{mg,(2'), ymi(z')} < nforn > and ¢«X;. 

Now if n > mo then we shall show that p{uegn(x'), yui(z")} < 3n. 

If K; then = a; and Hence the above distance is 
less than n. If 2 ¢UK; then pf{ym(z'), ym(z')} < and p{ug,(z’), 
magn(x*)} > hence p{ysgn(2"), < 2n + p{megn(x'), ymi(z')} < 39 = «. 

This proves the corollary. 

Now suppose that X, and X- are oriented 2-spheres and each true cyclic 
element S' of 2, and S* of 2 is so oriented that Dgr m(S'; X:) = 1 
and Dgr m(S°; = 1. 

In view of 10.6 we have at once 

Corotuary 16.11. For large enough n the degree of the partial mapping y(S') 
= §° for each S' of 2, is the same as the degree of gn(X1) = Xe. 

TuzorEM 16.12. If F* ~ fe thenf, ~ f; if fi F then fy 

Proor. The result follows by virtue of 11.2, 16.5 and 16.11. 

It is now possible to tabulate the results of this chapter in the following 


diagram 


fiF* ~ fe ~ fe 
fF if ofK fe 
fF ~hofk* ~h 
fF ~fh-fK ~h 
fF ~h 
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CHAPTER V 


TOPOLOGICAL APPLICATIONS 


The results of Chapter IV are now applied to the fundamental problem men- 
tioned in §7, and to other questions concerning representation. 


17. Representations 


It has already been mentioned that if the equivalence relation used is Fréchet 
equivalence, then the equivalence classes obtained will be called Fréchet surfaces. 
In the event we use positive Fréchet equivalence the resulting classes will be 
called oriented Fréchet surfaces. The notation for the first class will be [f] 
whereas for the second we will use the symbol [f]*. 

Suppose that [f]* is an oriented Fréchet surface with a single known represen- 
tation f(X) = Y. The factorization for f is lm(X), m(X) = 2%, l(2) = Y. 
Suppose that X is oriented, then this induces an orientation on each 2-sphere 
of 2; that is, the orientation such that Dgr m(S; X) = 1 for each 2-sphere S of 2. 
Suppose that we have a mapping m,(X,1) = 2 from an oriented 2-sphere X; onto 
> having the following properties: 1) m; is monotone, and 2) Dgr m,(S; Xi) = 1 
(—1) for each 2-sphere S of 2. If m has these properties we say that it is mono- 
tone and positively (negatively) consistent with m. A mapping m is said to be 
monotone and consistent with m if it is monotone and either positively or nega- 
tively consistent with m. 

Consider the mapping fi(X1) = lm(X). By Theorem 16.1 (16.2) we know 
that if m, is monotone and positively (negatively) consistent with m then 
fu(Xi)F* ~ f(X) ~ f(X)). Moreover, if fi(Xi)F* ~ f(X) ~ 
f(X)) then there is a monotone mapping m,(X,) = > which is positively (nega- 
tively) consistent with m such that f,(X1) = lm(X). In view of these remarks 
we have the 

FUNDAMENTAL STRUCTURE THEOREM 17.1. If f(X) = lm(X), m(X) = &, isa 
representation of an oriented Fréchet surface [f]* (Fréchet surface [f]), then the 
totality of representations of [f]*([f]) is the totality of mappings of the form 
fi(X1) = lm(X1) where m(X1) = 2 is any monotone map positively consistent 
(consistent) with m. 

We do not plan to make any direct topological applications of this theorem 
in this paper, but it is clear that if we are to obtain representations which enjoy 
certain desirable properties then the mode of attack is to alter the monotone 
factor of the given representation accordingly. 

A rather useful theorem in many applications is the weaker 

THEOREM 17.2. If f(X) = lm(X), m(X) = &, is a representation of an oriented 
Fréchet surface [f|* (Fréchet surface [f]), and u(X) = X is a monotone mapping of 
degree 1 (is any monotone mapping) then f(u(X)) is also a representation of the 
oriented Fréchet surface [f|* (Fréchet surface []). 
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CuapTer VI 


AREA 


The applications in this chapter are concerned with the problem of area both 
for oriented and ordinary Fréchet surfaces. We state the case for oriented 
surfaces; there will be no difficulty in making the usual translation to the case 
of Fréchet surfaces. 

The repeated insertion of the adjective oriented becomes laborious and so we 
drop it on the explicit understanding that all statements are made for oriented 
surfaces. In conformity with this understanding we write [f] instead of [f]* 
as heretofore. 

It should also be noted that the general outline of the application is due to 
Morrey. In spite of the fact that Morrey based his whole argument on a false 
structure theorem, his final theorems are, in the main, correctly stated. 


18. Preliminary concepts 

Without attempting an exposition of the subject—here we can consult Radé 
[7|the situation for oriented Fréchet surfaces is roughly this. The collection 
of all such surfaces is topologized and in the resulting space there is selected a 
favored class of surfaces called oriented polyhedra. ‘Topologically this favored 
class is important because it is dense; analytically it is important because each 
polyhedron [p] can be assigned a non-negative number E[p|—known as the 
elementary area in a perfectly natural manner. Moreover, E[p] is a lower 
semi-continuous function on the class of polyhedra. Explicitly, if [p,] — [p] 
in the topology and all surfaces are polyhedra, then lim inf E[p,] 2 E[p]. 

The Lebesgue area of a surface [f] is how defined by extending the function 
E{p|. Since the polyhedra are dense in the space there is at least one sequence 
[pn] — [f]. With this particular sequence there is associated a non-negative 
number defined to be lim inf E[p,]. (For convenience infinity is spoken of as a 
number.) The infimum of the numbers lim inf E[p,] is defined to be the Le- 


besgue area of [f]. 
L{f] = inf lim inf Efp,]................ (VI, D 


n 

The topologization of the collection of surfaces is afforded by the oriented 
Fréchet metric defined by McShane [4]. An equivalent topologization is the 
following [f,] — [f] if and only if there is a representation f(X) of [f] and for 
each n a representation f,(X) of [f] such that f,(x) = f(x), z«X. (The notation 
stands for uniform convergence.) 

The literature contains several methods of selecting the class of polyhedra, 
the definition depending on the manner in which the class is to be employed. 
The different methods all yield the same number for the Lebesgue area. 

For the purpose of this discussion a polyhedron is a surface [p] which has a 
representation of a particular kind now to be defined. 
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To begin with the range is understood to be a geometrical sphere in E;. The 
concept of a triangulation and 7-triangulation has already been introduced, 
(11.1). A transformation p(X) = Y is said to be a polyhedral mapping if there 
is a triangulation T of the oriented 2-sphere X such that for each A of T the 
mapping p(x), ze A is: 1) a homeomorphism onto a rectilinear triangle p(A) 
in E; ; or 2) a monotone map onto a segment of a straight line in &; ; or 3) a 
(monotone) map onto a single point in Z;. The triangulation T’ is said to be 
associated with the polyhedral mapping p(X) = Y. 

A surface [p] is an (oriented) polyhedron if it has a representation p(X) = Y 
which is polyhedral. 

Consider a polyhedral mapping p(X) = Y and associated triangulation 7. 
If Ae 7 then p(A) has an area E(p(A)) in the ordinary sense of the word, an 
area which may, of course, be zero. Let E(p) = ZE(p(A)) where the summa- 
tion is taken over all A’sin T’. This is called the elementary area of the mapping 
p(X) = Y. The important thing is that if a polyhedron p has two polyhedral 
representations p:(X1) and po(Xe), then E(p:) = E(p2). Hence this common 
value may be designated by E[p]| and spoken of as the elementary area of the 
polyhedron [p]. 

A triangulation 7* is said to be a refinement of the triangulation T' if for each 
A* of T* there is a A of TJ’ such that A* C A. We notice that if p(X) is a poly- 
hedral mapping with associated triangulation 7’, and 7* is a refinement of T, 
then it is not true in general that p(X) is polyhedral with respect to the triangu- 
lation T*. 

We need a further remark on the definition of area. It is clear from the 
definition that for any surface [f] there exists a sequence of polyhedra {[p,]} 
such that [p,] — [f] and also E[p,] — L[f]. The first limit means that there is a 
representation f(X) of [f] and a representation p,(X) of [p,] such that p,(x) = 
f(x), «eX. Unfortunately there is no a priori guarantee that p,(x) will be a 
polyhedral mapping. It is known, however, that given any representation 
f(X) of a surface [f] from a geometrical sphere X in E;, there is a sequence of 
polyhedral mappings P,,(X) and associated triangulations 7’, such that 


1) pale) f(z), eX. 
2) Elp,| L{fl. 

3) > 0 

4) 8(T'x3 px) = max d(p,(A)) — 0) 


(For a discussion of the comparable case on a 2-cell, see Youngs [12]). 


19. The stretching process 
Suppose p(X) = Y is a polyhedral mapping with an associated triangulation 
T consisting of N triangles. Associated with the triangle A on the sphere X 
is the rectilinear triangle A* whose vertices are the same as the vertices of the 
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triangle A. Consider a rectilinear coordinate system on A* and (£, », ¢) system 
in Ey , with origin yo. There is a unique map £ = £(u, v), 7 = (u,v), ¢ = ¢(u,v), 
linear in both u and v—expressible in vector notation as ¥ = X(u, v)—defined by 
the requirement that X(u, v) agree with p(x) on the vertices of A*. These 
vertices are, of course, the only points of A* at which p(x) is defined. 

For R > r > O consider the vector mapping 


due to Morrey [6], Rad6é and Reichelderfer [9]. 

For any integer n consider a triangulation 7". of A* with the important prop- 
erty that the vertices of 7’. divide each side of A* into exactly n equal parts. 
Suppose are the triangles of T* 

There is a unique linear map X3(u, v) from at compatible with the requirement 
that X7(u, v) agree with 9)(u, v) at the vertices of 6; . The mapping ¥*(u, v) 
defined to be X;(u, v) if (u, v) € 87 is known as quasi-linear, [McShane 4]. The 
ordinary area of the image triangle ¥}(6;) is denoted by E(%;(8;)), and the sum 
of these numbers, for i = 1, --- , k, is denoted by E(¥*(A*)). The mapping 
Y)(u, v) is smooth enough to argue by classical results that for any « > 0 there 
is an integer n(e, A*) such that if n > n(e, A*) then there is a triangulation 7% 
of the above type with the property that, for the associated quasi-linear mapping 
X*(u, v) we have 


E(X*(A*)) < + «, 


where the symbol J(%*) represents the number obtained by an application of the 
standard integral formula for the area. 

A highly interesting result of Morrey [6], Radé and Reichelderfer (9] enables 
us to assert that 


and so 


E(Q*(A)) < + 


For each A* of T there is an associated n(e, A*). Select an integer n larger 
than the maximum of these numbers and consider a triangulation 7, for each 
A* of T. The projection of these triangulations onto X from the center of X 
yields a refinement 7’, of the triangulation 7’. (It is understood that the original 
T is fine enough to allow this statement.) 

Define a modified mapping p*(x) on this wise. The point zx is the projection 
of a point (u, v) on some 8; of a triangle A*. Let p*(x) = X; (u,v). The special 
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choice of triangulations T°, relative to the sides of A* guarantees that p*(z) is 
continuous over X, in fact, the mapping is polyhedral. Moreover, 
R 
1) E(p*(X)) < p= + Ne 
2) p{p*(x), p(x)} < 6(T;p) + R, re X 
3) If a triangle A of T is such that 


4 


p(A) has every point at a distance less than r from yo, then p*(A) = y. 
We shall speak of having obtained p*(x) from p(z) relative to yo, R, r and «. 


20. The additivity of the area 


The applications of the stretching process are concerned with a situation in 
which we have a surface [f] with representation f(X) = lm(X) = Y, m(X) = = 
and has a cut point c. Then 2 = I; + I where both and I; are open and 
= 0. In fact, placing = 2, and = both 2; and are cactoids 
and have the single point c in common. 

For each 7, let 1; be the partial mapping /(2;) C Y so that we can write 1;(2;) = 
Y. Let fi(X) = X). 

For each true cyclic element S of 2 choose the orientation so that 
Der m(S; X) = 1. Let X; and X2 be two oriented geometric spheres in E; the 
point of tangency being d. By a standard argument there is a monotone map- 
ping m;(X;) = 2; such that m;(S; X;) has degree 1 for each 2-sphere S in 2;, 
a= 1,2. (This is, but for degree, Theorem 8.8.) 

Now consider the sphere X. There is a monotone map p(X) = X; + Xe which 
is one to one except that y~ (d) is a great circle which we shall call the equator 
Q of X. Moreover u(X;; X) = X; has degree 1 for each 7. Suppose that 
'(X,) = A; Then A: Ag = Q. 


{mu(z), xveAy 
{mou(x), 


is of some importance. Not only is m*(X) = = but m*(S; X) has degree 1 for 
each 2-sphere S of 2. By the fundamental structure Theorem 17.1 the mapping 
f*(X) = Ilm*(X) is also a representation of [f], and so L[f] = L{f*]. 

Let mi(X) = mia(X; ; X), and f7(X) = la?(X), i = 1, 2. We now have 
= [fi]. 

THEorEM 20.1. L{f] = L[fi] + L[fel. 

Proor. Parrl. Lf{f*] = L{ft] + 

From (VI, 2) we know that there is a sequence of polyhedral mappings p,(X) 
such that p{pn(x), f*(x)} < 1/n’, x E[p,] L{f*] and both | | and ; 
Pn) are less than 1/n’. If then p,(x) f*(x) = Im*(x) = Uc) = w- 
By U(r) we shall understand the set of points in E; at a distance less than r 
from yo. For any integer k consider U(1/k’). There-is a sequence of integers 
Mm < < ms such that p,,(x) U(1/k’) for ¢Q. Discard the poly- 


The monotone map m*(x) = 
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hedral mappings not in the sequence {p,,} and rename the sequence {p,}. It 
is clear that n, 2 k and so 1/n, S 1/k. 

Suppose there are N; triangles in the triangulation 7',, which is now renamed 
T,. Let = 1/(k-N;) and using R = 2/k, r = 2/k obtain py (x) from p,(zx) 
relative to yo, R, r, and «. Then by (VI, ns 


pl pe (x), pe(x)} < +R 
< + 2/k 
< 1/k? + 2/k. 


Moreover, any A of 7; having a point in common with Q is such that p,(A) 
has a point in U(1/n;) and since its diameter is less than 1/ni < 1/K it follows 
that py(A) U(2/k*) = U(r). This implies that pe(A) = yo. 

Let 


* 

For each 7, the mapping Dik (2) is polyhedral (the associated triangulation is 
T;). Furthermore + = Efpg] and = fi(z). 

Hence L{ft] < lim inf E{p%], i = 1, 2. 

Now L{f*] = lim Efp,] = lim Efp¢] = lim inf Efph] + lim inf = 
Lift] + 

Part II: L{f*] Lift] + Life]. 

For each 7 select a sequence of polyhedral mappings Pin(x) with associated 
triangulations 7’, such that p{pin(x), fi(x)} < uniformly in E[pin] 
L{f?] and both | Tin | and 8(T'in Pin) are < 1/n’. 2). 

For each 7 repeat the process of part I to obtain pix(x) such that 


Elpul-; ii + 1/k 


plpir(x), Dia(x)} < 1/k + 2/k 
= WwW, por(A1) 
Define 
pr(t) = pul) if i= 1,2. 


Now pz (x) is a polyhedral mapping and the associated triangulation is essen- 
tially Ty, in A; and Tx in Az. A slight modification must be made in those 
triangles A of Ti which have two sides cut by 2. 

We have = Elpul + and pi(e) =, Hence Lif*] 
lim inf Elpe] = lim + lim = Lift] + Lift}, 
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This completes the proof of the theorem. 
This theorem is the key to a much more general situation in which we have 
additivity. Suppose, for example, that the cactoid 2 associated with a surface 


[f] is a cyclic chain C(a, b). Let a be an arc from a to 6 and suppose a;, --- , a, 
are cut points of C(a, b) in order from a to b along a. Then C(a;, ais:) = C; 
is a cyclic chain from a; to d;4: for? = 0, --- ,n;a@ = a,b = anys. 


Consider f;(X) = lm(C; ; X) for i = 0,---,n. These mappings represent 
surfaces [f;] and we have the 

Corotuary 20.2. Lif] = Llfo] + --- + Lff,l. 

Proor. The argument is by induction on the number of cut points. The 
theorem is obviously true if there are no cut points. Suppose it is true for any 
collection of (n — 1) cut points. Consider the collection of cut points a, --- , 
then we have = L[fo] + + + L{lm(C(anx, 6); X)]. Now 
to the mapping lm(C(a,_1, b); X) apply the theorem with a, replacing c. The 
chain C(a,_1, 6) is broken up into two sub-cactoids C(an1, dn) = Cr4 and 
C(an,b) = Cn. Tf faa = X), fan = lm(C, ; X) then L{lm(C(a,1, b); 
X)| = L{fn+l] + L{f,J, and the assertion is proved. 

Lemma 20.3. If [f] is a surface and the associated cactoid = is an arc then 
L{f] = 0. 

Proor. By 17.1 we can obtain a representation f(X) = lm(X) such that m‘(c) 
is a line of longitude on X unless o is an end point of = and in this exceptional 
event m ‘(c) is either the north or south pole of X. It is now clear that we can 
find polyhedral mappings p,(X) such that p,(x) = f(x) and E[p,] = 0. 

We return to the case of a surface [f] whose associated cactoid = is a single 
cyclic chain C(a, b). Consider an are a as before from a to b and let C*(a, b) 
denote the modified chain consisting of the arc a together with all the 2-spheres 
in C(a, b) of diameter > 1/n. These are finite in number. Let the cut points 
of C(a, b) on these 2-spheres be a, --- , @m. Consider a mapping u,(2) = C, 
which is the identity except on 2-spheres of diameter < 1/n, and on each of 
these is a monotone map of the 2-sphere onto the segment of a it contains leaving 
the cut points fixed. 

Let ,f(X) = lu»m(X); the associated cactoid is C,,(a, b). If we denote a by 
a and b by an41 then with the subchains C,,(a; , a;;) we have partial mappings 
nfi aS in corollary 20.2, for i = 0,---,m. Hence 


= +--+ + 


The mappings ,f;(X) fall into precisely two categories: there are some for 
which the associated cactoid C,,(a; , a:41) is a 2-sphere; there are others for which 
it is an arc. In the second case L,fi] = 0. 

With the original mapping f(X) and cactoid C(a, b) we have the subcactoids 
C(a;, ai4:) and partial mappings f;(X) as in corollary 20.2, for i = 0, ---,m. 
Hence 


= + + Lifnl. 
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The mappings f;(X) also fall into precisely two categories: there are some 
for which the associated cactoid C(a;, ai41) = C,(a;, ais:) is a 2-sphere; then 
there are the remaining mappings which we do not characterize in their own 
right. 

For a mapping f;(X) of the first type fi(X) = ,f:(X) and so L[fi] = L[nfid; 
for a mapping of the second L[{f;] = 0 = Li,fi]. Hence we can draw the con- 
clusion that L[f] = LI,f]. On the other hand it is clear that ,f(z) = f(x) and 
hence lim inf L[,f] 2 Lf]. These two facts imply that L{,f] — Lif]. 

The above discussion shows that if S,, --- , S;, are the 2-spheres of diameter 
>1/n in C(a, b) and g:(X) = lm(S;; X) then L{,f] = Lig) + --- + 
Hence L{f] = limn. {Lig} + --- + 

For terminological convenience we shall record the 2-spheres of = as S,, 
S:, S3,-°:* and the mappings lm(S; ; X) by f:(X), then we have proved the 

Lemma 20.4 L[f]= Lf). . 

We are at last in a position to examine a general mapping f(X) = lm(X) = Y, 
m(X) = 2. Let A be any A-set in 2. The components of 2 — A can be ar- 
ranged in order by size as only a finite number are larger than 1/n, n = 1, 2,3, --- 


Suppose such an arrangement is B,, B., B;,---. For each 7 the set F(B;) 
is a single point b;. Define f(A; X) = lm(A; X), and u,(Z) € = to be the 
mapping which is the identity except on B,,-:--,B.. Ifoe Bj, 1 Sj Sk, 


then u(c) = b;. This is a monotone retraction. Let f,(X) = luxm(X). It 
follows that f.(x) — f(A; x), and so lim inf = LIf(A; X)]. 

As a consequence of 20.1 it is true that L{f] = L{f,| and in general Lif,| = 
Hence Lif] 2 L[f(A; X)]. 

Consider a chain decomposition {C;} = {C(p;, qi)} of = guaranteed by 8.6. 
Using the notation of 8.6 let lm(A, ; X) be denoted by g,(X). The compactness 
of the spaces involved assures us that g,(x) — f(x). Hence lim inf L[g,] 2 L[f]. 
But < Lif]. Hence lim L{g,] = Lif]. 

As for the mapping f(C; ; X) = Im(C; ; X), an argument using 20.1 shows 
that Lign] = ; X)] + + Lif(C, ;X)]. Hence Lif] = LIf(C; ; X)]. 
(Note the proof and discussion of Corollary 20.2) 

If the 2-spheres of = are recorded as S,, S2, Ss, --+ and f;(X) = lm(S; ;X), 
then using Lemma 20.4 we have a proof for the 

Apprtiviry THEOREM 20.5. Lif] = 

In this connection it is interesting to note an elegant result of Radé [8]. 

TuroreM 20.6. If the cactoid = associated with a mapping f(X) is a single 
2-sphere, then L[f] > 0. 

In the event there are 2-spheres in each of the cyclic chains C(p;, qi) 
of Theorem 20.5 then the mappings g,(X) defined above, which converge uni- 
formly to f(X), are such that 


< Lge] < Llgs] Lifl. 
It follows that it is possible to find a sequence of polyhedral mappings p,(x) = 
(x) such that 
< Elp:] < Elp:] < — Lif}. 
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This result is mentioned here because of its independent interest. It would 
be convenient to have the weaker result with < replacing < regardless of the 
structure of 2. 

A second consequence of the theorem of Radé (20.6) should be mentioned. 
In conjunction with the Additivity Theorem 20.5 we have 

TuroreM 20.7. L{f] = 0 if and only if the associated cactoid = is a dendrite; 
has no 2-spheres. 

For a direct proof of this result see Radé [8]. The theorem carries an air of 
fitness in view of the slender structure of a dendrite. 

It should perhaps be mentioned again, as in the beginning of this chapter, 
that these theorems are equally true for ordinary Fréchet surfaces. In addition, 
it is clear that, if f(X) is any mapping, then L[f*] = LIf]. 

A final application of the additivity theorem shows that the following state- 
ment is true. 


20.8. If fiK ~ fe then = 


APPENDIX 


So far the theory developed has been exclusively for surfaces of the type of a 
2-sphere. From the analytic point of view it is surfaces of the type of a 2-cell 
which are more convenient. In this appendix some comments are made con- 
cerning such surfaces. 

Without bothering to redefine all the concepts explicitly it is clear that such 
terms as positively Fréchet equivalent and positively Kerékjart6é equivalent, to 
mention two, carry over at once. In fact, the theorems are equally true for 
surfaces of the type of a 2-cell. 

There are several ways of checking this last statement. Any surface of the 
type of a 2-cell gives rise to a surface of the type of a 2-sphere by a “doubling up” 
process. Explicitly, we may suppose the surface [f] of the type of a 2-cell has a 
representation f whose range is the northern hemisphere of a geometrical sphere. 
If @ is the mapping reflecting the southern hemisphere about the equatorial 
plane onto the northern hemisphere, then f¢ is a mapping which together with f 
gives a composite mapping which is a representation of some surface [f*] of the 
type of a 2-sphere. 2 

It is not difficult to see that L[f*] = 2L[f]. Moreover, the hyperspace = is 
divided symmetrically into two hemicactoids 2, and 2, in an obvious way. In 
this fashion it is an easy exercise to arrive at an additivity theorem which is the 
analog of 20.5. 

In regard to the doubling up technique one should be warned that if fi and fe 
are two maps from 2-cells, and f; and fe are the results of doubling up, then 
though fiF ~ fe (for example) implies {fF ~ f: , the converse is false. This 
imposes some limitations on this mode of attack when applied to the topological 
aspects of the situation. Indeed, for the topological analogs a satisfactory 
technique appears to be a use of the characterization of hemicactoids as in 
Whyburn [10], together with a somewhat stronger approximation and matching 
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theorem. One has to exercise caution at the edge of the 2-cell. Though this 
plan of attack carries through it would be pleasing to reduce the 2-cell case to the 
argument for the 2-sphere by employing a simple artifice. 

It will have been noticed that the fundamental structure Theorem 17.1 is 
essentially used in the application to area only in obtaining a special representa- 
tion for the surface. (We obtained f* from f and f? from fi, t=1,2.) It 
seems likely that a direct approach can be found. 


PurpvE UNIVERSITY. 
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IN GENERAL A MEASURE PRESERVING TRANSFORMATION IS MIXING 


By R. Hatmos 
(Received November 13, 1943) 


In this note I continue the study of topological properties of the group of 
measure preserving transformations begun in an earlier paper.’ _ Using the 
methods and results of that paper I present the first proof of the old standing 
conjecture stated in the title.’ ‘In general” means of course that the exceptional 
set is of the first category in one of the usual “natural” topologies (the strong 
neighborhood topology) for measure preserving transformations. The principal 
new and quite surprising fact used in the proof is that for any almost nowhere 
periodic measure preserving transformation 7’ (and a fortiori for any mixing T’) 
the set of all conjugates of T'(i.e. the set of all ST‘S~*) is everywhere dense. It 
is this possibility of a dense conjugate class in a comparatively well behaved 
topological group (a rather natural generalization of the finite symmetric groups) 
that is contrary to naive intuition. 

Let G be the group of all measure preserving transformations of the unit 
interval.’ For any S eG, measurable set a, and positive number e, write 


N(S) = N(S;a, = {T:| Sa — Ta| < 


A unique topology (called the neighborhood topology) is defined in G by the 
requirement that the collection of all sets of the form N(S) be a subbase for 
open sets.” (With this topology G becomes a complete topological group satis- 
fying the second countability axiom)” A permutation (or more precisely a dyadic 
permutation of rank m) is a measure preserving transformation which maps 
each dyadic interval of rank m onto itself or onto another one by a translation. 
(A dyadic interval of rank m is an interval of the form (k/2”, (&k + 1)/2”),- 
k = 0,1, ---, 2” — 1, and a dyadic set of rank m is a union of such dyadic in- 
tervals.) A dyadic neighborhood is a set of the form {7':| Pa; — Ta;| <«, 
i = 1, ---,n} where P is a permutation and the a; are dyadic sets. The dyadic 


1 Approximation theories for measure preserving transformations, Transactions of the 
A.M. &., vol. 55 (1944), pp. 1-18, In the sequel I shall refer to this paper as (A). 

* See Eberhard Hopf, Ergodentheorie, Berlin, 1937, p. 41. Cf. also (A), §1. The word 
‘mixing’ is used in this paper in the weak sense of Hopf. 

’ It is known that the apparently great restriction to the unit interval is in fact very 
slight: measure theoretically most interesting spaces are isomorphic to each other and 
hence to the unit interval. See Paul R. Halmos and John von Neumann, Operator methods 
in classical mechanics, II, Annals of Math., vol. 43 (1942), Theorems 1 and 2. 

4 The symbol {-— : —} denotes the set of all those objects named before the colon which 
satisfy the condition stated after it. |— | is used to denote the measure of a measurable 
set and a — b standards for the symmetric difference of two sets, a — b = ab’ u a’b, where 
a’ is the complement of a. 

’ See Solomon Lefschetz, Algebraic Topology, New York, 1942,.p. 6. 

6 See (A), Theorems 2 and 3. 
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neighborhoods form a base for the open sets of G. A measure preserving trans- 
formation 7' is almost nowhere periodic (or for brevity non periodic) if the set 
of those £, 0 S  S 1, for which there is a positive integer n such that T"t = &, 
has measure zero. Finally I define the symbols d(S, T) and 8(S, 7’) by the 
formulae 


d(S, T) = | {:S§ Tt} |, 4(S, 7) = sup {| Sa — }. 


In terms of these definitions it is now possible to state the following four results 
(proved in (A)) which will be needed in the sequel. 

(1) For every non periodic T and positive integer n there is a measurable set b 
such that b, Tb, --- , T”*b are pairwise disjoint and 1/n = |b| = 1/(2n — 1). 

(2) Every dyadic neighborhood contains cyclic permutations of arbitrarily large 
ranks. 

(3) If T has (almost everywhere) exactly the period m then there exists a measur- 
able set b such that b, Tb, --- , T”"b are pairwise disjoint and|b| = 1/m. (See 
(A), p. 18). 

(4) Both d(S, T) and 6(S, T) are group invariant metrics for G (giving rise, 
however, to a topology different from cs neighborhood topology) and, for all S and 
T, 6(S, T) = d(S, T). 

Lemma 1. For every non periodic r and positive integer n there exists a measure 
preserving transformation S and a measurable se a such that (i) Sa = a, (ii) S has 
exactly the period nin a, (iit) |a| > 1/2, (iv) d(S, T’) S 2/n, and (v) S is nowhere 
periodic in a’. 

Proor: Choose b in accordance with (1) so that b, Tb, --- , T”~*b are pairwise 
disjoint and |b| = 1/(2n — 1). Write a = bu Thu - then |a| 2 
n/(2n —1) > 1/2. For &ebuTbu---u T””bd define ‘SE = Tt; for 
define SE = T~"**t. It follows that (7), (ci), and (#7) are satisfied: it remains 
to extend the definition of S in such a way as to satisfy (iv) and (v). 

Consider e = a’- Ta’ and Te = a’-Ta’. Since a’ = eu (a’-T""a) anda’ = 
Tevu (a’'- Ta) it follows that « = a’-T™‘a and y = a’-Ta have the same measure. 
For ¢ €e define St = Té; it remains now to define S on xso that Sx = y. Before 
doing this I remark that (iv) is already achieved. This follows from the fact 
that 


~ TE} C (T” bd) u (a’-T™“a), 
whence (using the fact that 7’ is measure preserving and that a’-T™ ‘a C Tb) 
d(S, T) < 2|b| S 2/n. 


7 This result is a strengthening of Lemma 3 of (A) and the first two paragraphs of the 
proof here given are a slight modification of the discussion in (A). A modification in this, 
as well as in several other results and definitions cited from (A), is necessary because the 
language and notation of (A) are considerably more algebraic than those of the present 


paper. 
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It may be assumed without loss of generality that x and y are disjoint, for 
otherwise S may be defined in zy to be any non periodic transformation and thus 
the problem reduces to finding a suitable transformation from zy’ to z'y. This 
assumption makes it possible to draw a schematic sketch of all steps considered 
so far. 


b | | | : 


a a 


Let c be the set of those points ¢ ¢e for which T"é €e for all n = 1, 2,---. 
Since Tc C c, the fact that 7 is measure preserving implies that (after possibly 
discarding a set of measure zero from c) Tc = c. It follows (since c C e) that 
c < Te. In other words almost every point of y gets out of e (and hence into ) 
in a finite number of steps. Write 


= ey, ee fori < k, exh, 


| 


and define for £ «yz, QE = T"t. Let P be any non periodic transformation of y 
into itself and define for ex, SE = PQ“. S is now everywhere defined and 
it remains to prove that it is non periodic. 

If & « Te is such that T"é « Te for all n > O then (by an argument similar to 
that in the preceding paragraph) except for a set of ’s of measure zero é and all 
its 7 transforms must belong to e also and consequently S"£ = T"§. In this 
case the non periodicity of T ensures that of S. Hence it is sufficient to show 
that no £ « y has a finite orbit under S. To prove this take £¢y,. Then since 
Tt eefori = 0,---,& —1, St = Also since Te for i = 1, --- , k, 
Tt,---, Tt do not belong to y. Hence ¢ is not only not periodic of some 
period < k but it does not even return to y in k steps. At the (k + 1)™ step 
(since = T*E = Qéex), = PQ'T"t, whence, using the definition of 
Q, S*tts = Pt. In other words: the first time that a point of y returns to y it is 
transformed by P. Hence the S-orbit of £ contains the (infinite) P-orbit of 
é and cannot therefore be finite. This completes the proof of Lemma 1. 

Lemma 2. For every non periodic T and positive integer n there exists a measure 
preserving transformation S which has (almost) everywhere the period n and for 
which d(S, T) S 4/n. 

Proor: The proof of this lemma is an application of Lemma 1 and consists 
of iterating that former result an infinite number of times. More precisely: 
after one application of Lemma 1 one obtains a set a, and a transformation S; 
with properties there described. The second time Lemma 1 is applied to S; on 
a; to find a set a; Ca; and a transformation S: such that S> has the period n 
on a@,|a2| = (1/2)|a;|, and d(S,, S:) < (2/n)|a,|. Proceeding in this 
way by induction one obtains a sequence of disjoint measurable sets a: , 2, °*° 
and a sequence of measure preserving transformations S,, S2, --- such that 


ij 
i} 
| i} 
bee Af 


for 
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, Si+1) (2/n) | (qu aj)’ | » | | = | (a,u -+-uaj)’ | /2, S,a; = a, 
and S; has period n in a;. It follows also (by induction) that | a u ---u a;| 
> 1/2 +--+ + 1/2 = 1 — 1/2’. Hence the union of all the a’s is (except 
possibly for a set of measure zero) the entire unit interval and therefore a measure 
preserving transformation S of the entire interval may be defined by writing, 
for £¢a;, SE = Si. S will have almost everywhere the period n and, moreover, 


d(T, S) d(T, S:) + d(Si, S2) + d(S2, Ss) + 
(2/n)(1 + 1/2 + 1/2? + 1/2 +--+) = 4/n. 


With these preliminary results out of the way I am now in a position to state 
and prove the first main theorem of this paper. 

THEOREM 1. The conjugate class of any non periodic measure preserving trans- 
formation To ¢G is (in the neighborhood topology) everywhere dense in G. 

Proor: Let N = N(P) = {T: | Pa; — Ta;| < ¢,i = 1, --- , n} be any dyadic 
neighborhood of some permutation P: it is to be proved that for some R ¢G, 
RTR¢N. Write M = {T:| Pa; — Ta;| < ¢/2,i = 1, ---, n} and apply 
to M the result (2) quoted from (A). It follows that M contains a cyclic per- 
mutation Q of rank m greater than the ranks of all a; and such that 1/2"~ < «. 
Then Lemma 2 may be applied to 7 to obtain an S which has (almost) every- 
where the period 2” and for which d(S, T>) < 4/2” < ¢/2. 

Observe next that Q and S are conjugate in G. For denote by q@, -** , qt 
(r = 2”) the dyadic intervals of rank m, so arranged that Qqi = q+: (¢ mod r) 
and denote by 8 a set of measure 1/2” which is disjoint from S"s fori = 1, ---, 
r—1. (The existence of such an % is guaranteed by (3)). Write s; = S's, 
i=1,---,r—1, and let R be any (measure preserving) transformation taking 
s into go. To extend the definition of R write, for any £es;, RE = Q'RS™“E. 
Schematically : 


S S 
So —— 81 —— S82 —— Sr-1 
— — G2 °° * 
Q 


It is quite easy to see that Q = RSR™. 


The proof can now be quickly finished. Since d(S, To) < ¢/2 and since (by 
(4)) d is invariant under the group operations, 
d(Q, RTR™) = d(RSR™, RTR™) = d(S, To) < €/2. 
Using the definition of 5 and the inequality stated in (4) it follows that 
| Pa; — RTR'a;| < | Pa; — Qa; | + | Qa: RT.R™a; | 
< | Pa; — Qa; | + 8(Q, RTR™) 
< d(Q, RTR™) < «. 
Consequently N(P); q.e.d. 
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To establish the residual character of any set M of measure preserving trans- 
formations it is sufficient, in view of the theorem just proved, to show that (i) 
M contains a non periodic transformation, (ii) M is self conjugate, and (iii) M 
isa G;.° In the sequel I shall show that the set M of mixing transformations 
has these three properties. 

Recall first the definition of mixing. A measure preserving transformation is 
mixing if there exists a set Jo of positive integers of density zero such that for 
any two measurable sets a and b 


limag, | T’a-b| = |a|- |b]. 


The properties (i) and (ii) of M are easy consequences of this definition. (i) 
Not only does M contain non periodic transformations but in fact every mixing 
T is non periodic.” For if the set of those whose period is say m had positive 
measure then (3) (applied to 7 considered on this set only) would yield a set c 
of positive measure with the properties T’”c = c, T’e-c = 0 for 0 <j <m. 
Writing a. = b = ¢, it follows that | T"a-b| =|c]|or 0 according as n = 0 
(mod m) or not. Since this sort of behavior clearly excludes the possibility of 
the limiting property of mixing transformations it follows that a 7 which is 
periodic on a set of positive measure cannot be mixing. (ii) If 7 is mixing and 
S = RTR™ then S is mixing. For 
| S"a-b| = | (RTR™)"a-b | = | RT"Ra-b | 

= | | = | |; 
the limiting property of 7' (with a, b replaced by Ra, R™'b) together with the 
fact that R is measure preserving yields the desired conclusion. 

To prove (iii), i.e. to prove that M is a G;, it is necessary to use the known 
Hilbert space characterization of mixing. A measure preserving transformation 
T may be regarded as a unitary operator on the complex Hilbert space Le , by 
defining Tf(¢) = f(T). (The unitary character of 7’ follows from the fact that 
measure and, consequently, integral are invariant.) Mixing in Hilbert space 
terms amounts to the existence of a set J of positive integers of density zero 
such that for any two functions f, g € Le , 


lim n (T”. g) (f, 1)(1, g) 
1 
(where f, g) is the inner product I f(&g(é) dé and 1 stands for the function 


identically equal to 1, so that (f, 1) = [ f(é) dé and (1, g) = [ g(é) dé.) 


8 Cf. the proof of Theorem 6, (A). 

* See Hopf, op. cit., p. 36. For all concepts and results quoted in this paragraph and 
the next the reader is selerred to Chapter III of Hopf’s book. 

” To see that this statement is really stronger than the original formulation of (i) it is 
necessary to make the non trivial observation that there exists at least one mixing trans- 
formation. See for example Paul R. Halmos, On wera Hi sae of compact groups, Bulle- 
tin of the A. M. S., vol. 49 (1943), pp. 619-624. 
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From the point of view of the preceding paragraph the set G of all measure 
preserving transformations becomes a subset of the set U of all unitary operators. 
On this latter set von Neumann’s “strong neighborhood topology” may be 
defined: a subbasic neighborhood of Ty «U is of the form 


N(To) = N(To;f, = {T:|| Tof — Tf || < ¢}, 


where f € > 0, and || f || stands, as usual, for (f, f)"”. I shall need to make 
use of the fact that this topology when specialized to the subset G of U coincides 
with the neighborhood topology defined earlier. (This fact is proved by showing 
that a topology equivalent to the “strong neighborhood topology” of von 
Neumann is obtained if f is restricted to be the characteristic function of a 
measurable set.”) Two further facts will be used in the proof that follows: 
first that for fixed f, g « Le the function ¢(7) = , g) is a continuous function 
of 7 (in the neighborhood topology) and, last but far from least, that 7' is mixing 
if and only if it has no non trivial proper values, i.e. Tf = Af (f # 0) implies that 
fis (almost everywhere) equal to the constant \ = 1. (This last fact is known 
as the mixing theorem.”) 

The remainder of the proof (that M is a G;) is somewhat analogous to the 
proof of Theorem 6 of (A). Let fi, fe, --- be a dense sequence in Lz ; write 


j, m, n) {T: | (T"fs fi) (fi, 1)(1, | < 1/m} 


and 
F = NiNf.U,E(G, j, m, n). 


The fact that (7'f, g) is a continuous function of T implies that H(i, 7, m, n) is 
open and therefore that F isa G;. The limiting property of mixing 7’s shows 
that M C F: I shall now show that, conversely, if 7’ is not mixing then T' is 
not in 

If T is not mixing then, by the mixing theorem, there is an f ¥ 0 and a com- 
plex number \ of modulus one (since 7’ is unitary) such that Tf = Af. Without 
loss of generality it may be assumed that f is orthogonal to the trivial proper 
function 1, (f, 1) = 0, and that f is normalized, || f || = 1. Choose now an i for 
which || f — fi || < .1 and choose j = i, m = 2. I shall show that T' does not 
belong to F. Observe first that || fi || < || + || fi: 1.1. It follows 
that. 


| 
S| (1'"f, f) fi) | + | (T” Ji) (T" iy Si) | | (T”. i+ Si) (fi, 1)(1, fa)| 
+ | (fi, 1)(1, fi) (fi, 1)(1, f) | + | (fi, 1)(1,f) (f, 1)(1, f) | 


11 See John von Neumann, Zur Algebra der Funktionaloperationen und Theorie der nor- 
malen Operatoren, Math. Ann., vol. 102 (1929), p. 386. 
2 Hopf, op. cit., p. 37. 
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S14 11+ 1+ | (1, f) (fi, 10, fo | 
< 5 + | f) Gi, U0, fad |. 
In other words 
— Fe, D0, | > 5 = 1/m 


for all n. Hence: 


THEoREM 2. In the neighborhood topology of G the set M of mixing transforma- 
tions is a residual G;. 
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MEASURING THE SURFACE AREA OF A CONVEX BODY 


By P. A. P. Moran 
(Received March 20, 1944) 


In applied sciences it is sometimes desirable to measure the surface area of a 
small body. It is difficult to do this directly, but if the body is convex we may 
use a well-known theorem on convex bodies. If K is such a body in three- 
dimensional space, let S be its surface area, A(n) the area of its projection on a 
plane perpendicular to the vector.n, and A the mean of A(n) for all directions 
of n. Then the theorem states 


S = 4A. 


To obtain a value for this mean in practice, we may take the average of the 
projections in a finite number of directions, and the question then arises as to 
the error which may result in such a process. In what follows we suppose the 
projected area to be measured in a set of directions which is in one case the set of 
directions of the normals to the faces of a dodecahedron, and in a second case 
the set of directions of the normals to the faces of an icosahedron. We shall 
then find exact upper and lower bounds for the ratio between the estimated 
area and the exact area. In practice the area of the projection might be meas- 
ured by measuring the shadow of the body when placed in the path of a suitably 
directed beam of light, or by observing the body through a suitably directed 
optical instrument in whose eyepiece is placed a graticule ruled in squares. For 
a bibliography on the experimental side, see Markwick.’ 

H. Steinhaus’ has considered an analogous problem concerning a convex figure 
ina plane. If F is such a figure and f(@) is the length of its projection on a line 
making an angle @ with a fixed direction, it is known that the length of the 
perimeter of F is + times the mean of f(@) for all 6. Steinhaus replaces this 
mean by the average of the values of f(@) for six different values separated by 
intervals of 30°, and then finds exact upper and lower bounds for the ratio be- 
tween the estimated length and the true length. His method is trigonometrical, 
and does not extend readily to the three-dimensional case. We employ a vec- 
torial method, and the ideas of this method enable us to extend readily Stein- 
haus’s result to the case of an arbitrary number of equally spaced directions in 
a plane. 

We begin by considering the dodecahedral case. Let 7 be an element of area 
whose normal is a unit vector 1. We suppose 7’ to be of unit area and to be 
placed at the centre of a dodecahedron so that its projection in the direction of 
the normals to the twelve faces can be considered. Let these faces be numbered 
from 1 to 12. Let the line of the vector | intersect the face 1, and suppose the 


1A. H. D. Markwick, Chemistry and Industry, XV, 206 (1937). 
2H. Steinhaus, Akad. d. wiss. Leipzig, Ber., 82, 120-130 (1930). 
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faces adjoint to this face be 2, 3, --- 6 (Fig. 1). Let the faces opposite to 
1, 2, --- 6 be 7, 8, --- 12 respectively. Let l,, --- le be unit vectors directed 
from the centre of the dodecahedron in the directions of the normals to the 
faces 1, 2, --- 12. Then the mean of the areas of the projections of the element 
T in the twelve directions is given by 


1 12 


since each term in the sum is the area of the projection in the direction of the 
corresponding vector. We now find upper and lower bounds for this sum. Let 
the face 1 have vertices A, B, C, D, E, where CD, DE, EA, AB and BC are edges 
in common with the faces 2, 3, 4, 5 and 6 respectively, and let the middle points 
of CD, DE, EA, AB and BC be A’, B’, C’, D’, E’. Then two cases arise. 


Fig. 1 


I. The line of the vector | intersects the dodecahedron in a point inside the 
pentagon A’B’C’D’E’. 

If this is true the vectors |, , - - - 1, all lie on one side of the plane of the element 
of unit area, and the vectors I, , --- lh: on the other. By symmetry we can 
clearly confine ourselves to those on the same side as the vector I. Then the 
mean of the projected areas is 
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6 
Now the vector 21, is a vector in the direction of , and has magni- 


) the tude 1 + 5 cos @, where @ is the angle between the vectors associated with 
nent adjacent faces of the dodecahedron. A little calculation shows this angle to 
be 63.435°, and so cos @ = .4472. Then 


6 
> |1-1;| = % cos e(1 + 5 cos @) 
the where ¢ is the angle between the vectors] and1,. The largest attainable value 
Let of this expression is 
iges K; = 3(1 + 5 cos 6) 
ints = 5393 
and the smallest is 


= (0/2)(1 + 5 cos 6) 
= .4588. 


The area of one side of the element of area will be twice the mean projected area, i 
and so if S; is the estimated area and S the true area, we have ) 


918 = 2K, <  < 2K, = 1.079. 
Moreover these limits are clearly the best possible. 


II. The line of the vector | intersects the dodecahedron in a point outside | 
the pentagon A’B’C’D’E’ and inside the pentagon ABCDE. 

We can suppose by symmetry that the point of intersection with the dodeca- 
hedron is within the triangle CA’E’. Then the vectors 1,k, kh, 1s, ls and lo ‘Wy 
lie on one side of the plane of the element of area. The mean of the projections | 
in all twelve directions is i 


12 


e +b + ho} 

= 41-L 
where L is a vector which is the sum of the vectors in the bracket and lies in the 
direction from the centre of the dodecahedron to the point C. Now the mini- 


mum of the above expression occurs when the vector I points at A’ or E’, i.e. 
in the same direction as gave the minimum in case 1, and therefore giving the 
same result. We need therefore only calculate the maximum value of 41-L, ant 
which is §|L| and 
|L| = 3(cos ¢: + cos 


where ¢; is the angle between |, and L, and ¢» is the angle between |, and L. 
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A little calculation shows that 
3|L| = .984 


and we have in this case 


St < 984 < 1.079 
and therefore in whatever direction the vector normal to the element lies we find 
that the inequality of case I holds. The surface of any convex body can be 
considered as made up of infinitesimal elements, and so the above inequality 
holds for any convex body and the result is the best possible, since the bounds 
can actually be obtained when the convex body is flat. 

We now consider the similar problem where we measure the projections in the 
directions of the normals to the faces of an icosahedron. Let the unit vectors 
in the direction of the faces be 1, --- ko. Suppose once again we have an 
element of unit area at the centre of the icosahedron and that the unit vector in 
the direction of the normal to the element of unit area is 1. Let the line of the 
vector | intersect the face ABC, and suppose this face is numbered 1 corre- 
sponding to the vector 1. Let the faces having BC, CA and AB in common 
with ABC be numbered 2, 3, 4 respectively, and similarly for the corresponding 
vectors (Fig. 2). Let the other vectors associated with the faces having angles 
at the vertices A, B and C bel, and k, , , and ly» respectively. Let the 
vectors associated with the faces opposite to the faces 1, --- 10 be lu, --- ho 
respectively. Let A’, B’, C’ be the centres of the edges BC, CA and AB, and 
suppose A’S is a line on the face ABC such that it lies in a plane through the 
centre of the icosahedron perpendicular to the vector ];. Similarly, let A’P, 
B'M, B’R, C’Q, C’L be lines lying in planes perpendicular to k, b, ls, ls, ho. 
Let B’R and C’Q intersect in D, A’P and B’M in LE, and C’L and A’S in F. 
Then DC’FA’EB’ is an irregular hexagon. By symmetry it is sufficient to 
consider the following cases— 

1. The line of the vector I intersects the face ABC within the irregular hexagon 
DC'FA'EB’. Then the vectorsh,k,b,h,kb,kh,k, Is, Is, ho all lie on one 
side of the plane of the element of area and the sum 


20 10 10 

where L, is a vector from the centre of the icosahedron in the direction ], , and 
having the magnitude 


1 + 3 cos a + 6 cos 8 


where a is the angle between the vectors l, and k, , and 8 is the angle between the 
vectors 1, and 1;. Calculation shows that cos a = .7465 and cos 8 = .3333. 
Then 


21-L = 2 cos e(1 + 3 cos a + 6 cos 8) 
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where ¢ is the angle between | and L. The maximum value of this product is 
2(1 + 3 cos a + 6 cos 8) 
and the minimum value 
Qe0s 6:(1 + 3 cos a + 6 cos 


where 6; is the angle between }, and the line from the centre of the icosahedron 
to the point D (or E, or F) for it is easy to see that the point D is further from 
the centre of the triangle than A’, B’ or C’. Then if the conditions of case I 


Fig. 2 


are satisfied the ratio of the estimated to the true value of the element of area 
lies between 


(1 + 3 cos a + 6 cos 8) 
and 
3 cos 6,(1 + 3 cos a + 6 cos 8) 


and some calculation shows these to be 1.048 and .957 respectively. 


2. The line of the vector I intersects the face ABC in the lozenge shaped 
region ARDQ. Then the vectorsh,h,b,l,b,b,h, ho, hs and lp lie on 
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one side of the element of area and the remainder on the other side, and we can 
write 


20 
| = +h +b th +h +b th + bo + bs + 
= 21-L, 


where 


+h + ho + hs + 


and is therefore a vector from the centre of the icosahedron in the direction of 
the point A, and has magnitude . 


5(cos y + cos 4) 


where y is the angle between 1, and Lz and 4 is the angle between |, and L,. 
Calculation shows the value of this to be less than the maximum value of the 
ratio between the estimate of the area and the true value in the first case. The 
minimum for the ratio will occur when | is pointing in the direction of the point 
D, and the value of the minimum is therefore the same as in case I. 


3. The line of the vector | intersects the face ABC in the triangle RDC’. 
Then the vectors k,l ,h Is, ho and ly all lie on one side of the 
element of area. Then we can write 


20 
+h +h +e + bo + bs} 


= 21-{L. + 2ls} 
= 21- {Ls} 


where 


L; = + 2k. 


A little calculation shows the absolute value of Ls to be less than that of L,, 
and so the maximum value of the ratio of the estimated to the true area is less 
in this case than in case I. Moreover the minimum will occur when the vector 
1 points in the direction of some point on RD or DC’ and so will not be smaller 
than the minimum in case I or II. Consequently in any case the ratio of the 
estimated to the true area will lie between 1.048 and .957. By summing all the 
elements of the surface of a convex body the same result is seen to hold for the 
latter, and this completes the discussion. 

Similar methods can be applied to extend the results of Steinhaus for convex 
figures in a plane. Suppose that we measure the projection of such a body in 
n evenly spaced directions separated from one another by angles equal to 2x/n. 
Two cases arise according as to whether n is even or odd. If n is odd the result 
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is the same as measuring the body in 2n evenly spaced directions. We there- 
fore need only consider the latter case, and by applying an argument similar 
to the above we show very simply that the ratio of the estimated perimeter to 
the true perimeter must be between 2/(2n sin /2n) and (x cos 2/2n)/ 


(2n sin +/2n). 
For n = 6, these limits are 1.012 and .977 
forn = 10, 1.004 and .992 
forn = 20, 1.001 and .998. 


These results also have applications to practical problems. 
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